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SEQUENCE LISTING 



(1) GENERAL INFORMATION: 



(i) APPLICANT: Murphy, Patricia D. 

Allen, Antonette C. 
Alvares, Christopher P. 
Critz, Brenda S. 
Olson, Sheri J. 
Thurber, Denise 
Zeng, Bin 



(ii) TITLE OF INVENTION: Coding Sequences of the Human 
BRCA1 Gene 

(iii) NUMBER OF SEQUENCES: 72 

Uv) CORRESPONDENCE ADDRESS: 

(A) ADDRESSEE: Howrey & Simon 

(B) STREET: 12 99 Pennsylvania Avenue N. W. 

(C) CITY: Washington 

(D) STATE: DC 

(E) COUNTRY: USA 

(F) ZIP: 20004 

(v) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Floppy disk 

(B) COMPUTER: IBM PC compatible 

(C) OPERATING SYSTEM: PC-DOS/MS -DOS 

(D) SOFTWARE: Patentln Release #1.0, Version #1.3 0 

(vi) CURRENT APPLICATION DATA: 

(A) APPLICATION NUMBER: 

(B) FILING DATE: 

(C) CLASSIFICATION: 

(viii) ATTORNEY/ AGENT INFORMATION: 

(A) NAME: Albert P. Halluin 

(B) REGISTRATION NUMBER: 25,227 

(C) REFERENCE / DOCKET NUMBER : 5371. 34. US 00 

(ix) TELECOMMUNICATION INFORMATION: 

(A) TELEPHONE: 650-463-8109 

(B) TELEFAX: 65 0 -4 63 - btC 0 



(2) INFORMATION FOR SEQ ID NO : 1 : 

(i) SEQUENCE CHARACTERISTICS: 
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(A) LENGTH: 5711 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : not relevant 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Homo sapiens 

(B) STRAIN: BRCA1 (omil) 

(viii) POSITION IN GENOME: 

(A) CHROMOSOME/ SEGMENT: 17 

(B) MAP POSITION: 17q21 



{xi) SEQUENCE DESCRIPTION: SEQ ID NO : 1 : 



AGCTCGCTGA 


GACTTCCTGG 


ACCCCGCACC 


AGGCTGTGGG GTTTCTCAGA TAACTGGGCC 




CCTGCGCTCA 


GGAGGCCTTC 


ACCCTCTGCT 


CTGGGTAAAG TTCATTGGAA CAGAAAGAAA 




TGGATTTATC 


TGCTCTTCGC 


GTTGAAGAAG 


TACAAAATGT CATTAATGCT ATGCAGAAAA 




TCTTAGAGTG 


TCCCATCTGT 


CTGGAGTTGA 


TCAAGGAACC TGTCTCCACA AAGTGTGACC 




ACATATTTTG 


CAAATTTTGC 


ATGCTGAAAC 


TTCTCAACCA GAAGAAAGGG CCTTCACAGT 




GTC CTTTATG 


TAAGAATGAT 


ATAACCAAAA 


GGAGCCTACA AGAAAGTACG AGATTTAGTC 




AACTTGTTGA 


AGAGCTATTG 


AAAATCATTT 


GTGCTTTTCA GCTTGACACA GGTTTGGAGT 




ATGCAAACAG 


CTATAATTTT 


GCAAAAAAGG 


AAAATAACTC TCCTGAACAT CTAAAAGATG 




AAGTTTCTAT 


CAT C C AAAGT 


ATGGGCTACA 


GAAACCfTTC^P raafiBflarTT 1 PTAPnnTvnnin 


540 


AACCCGAAAA 


TCCTTCCTTG 


CAGGAAACCA 


GTCTCAGTGT CCAACTCTCT AACCTTGGAA 


600 


CTGTGAGAAC 


TCTGAGGACA AAGCAGCGGA 


TACAACCTCA AAAGACGTCT GTCTACATTG 


660 


AATTGGGATC 


TGATTCTTCT 


GAAGATACCG 


TTAATAAGGC AACTTATTGC AGTGTGGGAG 


720 


ATCAAGAATT 


GTTACAAATC 


ACCCCTCAAG 


GAACCAGGGA TGAAATCAGT TTGGATTCTG 


780 


CAAAAAAGGC 


TGCTTGTGAA 


TTTTCTGAGA 


CGGATGTAAC AAATACTGAA CATCATCAAC 


840 


CCAGTAATAA 


TGATTTGAAC 


.ACCACTGAGA 


AGCGTGCAGC TGAGAGGCAT CCAGAAAAGT 


900 


ATCAGGGTAG 


TTCTGTTTCA 


AACTTGCATG 


TGGAGCCATG TGGCACAAAT ACTCATGCCA 


960 


GCTCATTACA 


GCATGAGAAC 


AGCAGTT TAT 


TACTCACTAA AGACAGAATG AATGTAGAAA 


1020 


AGGCTGAATT 


CTGTAATAAA 


AGCAAACAGC 


CTGGCTTAGC AAGGAGCCAA CATAACAGAT 


1080 


GGGCTGGAAG 


TAAGGAAACA 


TGTAATGATA 


GGCGGACTCC CAGCACAGAA AAAAAGGTAG 


1140 


ATCTGAATGC 


TGATCCCCTG 


TGTGAGAGAA 


AAGAATGGAA TAAGCAGAAA CTGCCATGCT 


1200 


CAGAGAATCC 


TAGAGATACT 


GAAGATGTTC 


CTTGGATAAC ACTAAATAGC AGCATTCAGA 


1260 



45 



Patent 

Attorney Docket No. 5371.34.US01 

AAGTTAATGA GTGGTTTTCC AGAAGTGATG AACTGTTAGG TTCTGATGAC TCACATGATG 13 2 0 

GGGAGT CTGA ATCAAATGCC AAAGTAGCTG ATGTATTGGA CGTTCTAAAT GAGGTAGATG 13 8 0 

AATATTCTGG TTCTTCAGAG AAAATAGACT TACTGGCCAG TGATCCTCAT GAGGCTTTAA 144 0 

TATGTAAAAG TGAAAGAGTT CACTCCAAAT CAGTAGAGAG TAATATTGAA GACAAAATAT 15 0 0 

TTGGGAAAAC CTATCGGAAG AAGGCAAGCC TCCCCAACTT AAGCCATGTA ACTGAAAATC 1560 

TAATTATAGG AGCATTTGTT ACTGAGCCAC AGATAATACA AGAGCGTCCC CTCACAAATA 162 0 

AATTAAAGCG TAAAAGGAGA CCTACATCAG GCCTTCATCC TGAGGATTTT ATCAAGAAAG 1680 

CAGATTTGGC AGTTCAAAAG ACTCCTGAAA TGATAAATCA GGGAACTAAC CAAACGGAGC 1740 

AGAATGGTCA AGTGATGAAT ATTACTAATA GTGGTCATGA GAATAAAACA AAAGGTGATT 18 00 

CTATTCAGAA TGAGAAAAAT CCTAACCCAA TAGAATCACT CGAAAAAGAA TCTGCTTTCA 1860 

AAACGAAAGC TGAACCTATA AGCAGCAGTA TAAGCAATAT GGAACTCGAA TTAAATATCC 192 0 

AC AATT C AAA AGCACCTAAA AAGAATAGGC TGAGGAGGAA GTCTTCTACC AGGCATATTC 19 80 

ATGCGCTTGA ACTAGTAGTC AGTAGAAATC TAAGCCCACC TAATTGTACT GAATTGCAAA 2040 

TTGATAGTTG TTCTAGCAGT GAAGAGATAA AGAAAAAAAA GTACAACCAA ATGCCAGTCA 2100 

GGCACAGCAG AAACCTACAA CTCATGGAAG GTAAAGAACC TGCAACTGGA GCCAAGAAGA 2160 

GTAACAAGCC AAATGAACAG ACAAGTAAAA GACATGACAG TGATACTTTC CCAGAGCTGA 2220 

AGT TAAC AAA TGCACCTGGT TCTTTTACTA AGTGTTCAAA TACCAGTGAA CTTAAAGAAT 22 80 

TTGTCAATCC TAGCCTTCCA AGAGAAGAAA AAGAAGAGAA ACTAGAAACA GTTAAAGTGT 2340 

CTAATAATGC TGAAGACCCC AAAGATCTCA TGTTAAGTGG AGAAAGGGTT TTGCAAACTG 24 00 

AAAGATCTGT AGAGAGTAGC AGTATTT C AC TGGTACCTGG TACTGATTAT GGCACTCAGG 24 60 

AAAGTATCTC GTTACTGGAA GTTAGCACTC TAGGGAAGGC AAAAACAGAA CCAAATAAAT 252 0 

GTGTGAGTCA GTGTGCAGCA TTTGAAAAC C CCAAGGGACT AATTCATGGT TGTTCCAAAG 2580 

ATAATAGAAA TGACACAGAA GGCTTTAAGT ATCCATTGGG ACATGAAGTT AACCACAGTC 2 64 0 

GGGAAACAAG CATAGAAATG GAAGAAAGTG AACTTGATGC TCAGTATTTG CAGAATACAT 2700 

TCAAGGTTTC AAAGCGCCAG TCATTTGCTC TGTTTTCAAA TCCAGGAAAT GCAGAAGAGG 2 760 

AATGTGCAAC ATTCTCTGCC CACTCTGGGT CCTTAAAGAA ACAAAGTCCA AAAGTCACTT 2 82 0 

TTGAATGTGA ACAAAAGGAA GAAAATCAAG GAAAGAATGA GTCTAATATC AAGCCTGTAC 2 880 

AGACAGTTAA TATCACTGCA GGCTTTCCTG TGGTTGGTCA GAAAGATAAG CCAGTTGATA 2 94 0 

ATGCCAAATG TAGTATCAAA GGAGGCTCTA GGTTTTGTCT ATCATCTCAG TTCAGAGGCA 3 000 

ACGAAACTGG ACTCATTACT CCAAATAAAC ATGGACTTTT ACAAAACCCA TATCGTATAC 3 060 
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CACCACTTTT 


TCCCATCAAG 


TCATTTGTTA 


AAACTAAATG 


TAAGAAAAAT 


CTGCTAGAGG 


3120 


AAAACTTTGA 


GGAACATTCA 


ATGTCACCTG 


AAAGAGAAAT 


GGGAAATGAG 


AACATTCCAA 


3180 


GTACAGTGAG 


CACAATTAGC 


CGTAATAACA 


TTAGAGAAAA 


TGTTTTTAAA 


GGAGCCAGCT 


3240 


CAAGCAATAT 


TAATGAAGTA 


GGTTCCAGTA 


CTAATGAAGT 


GGGCTCCAGT 


ATTAATGAAA 


3300 


TAGGTTCCAG 


TGATGAAAAC 


ATTCAAGCAG 


AACTAGGTAG 


AAACAGAGGG 


CCAAAATTGA 


3360 


ATGCTATGCT 


TAGATTAGGG 


GTTTTGCAAC 


CTGAGGTCTA 


TAAACAAAGT 


CTTCCTGGAA 


3420 


GTAATTGTAA 


GCATCCTGAA 


ATAAAAAAGC 


AAGAATATGA 


AGAAGTAGTT 


CAGACTGTTA 


3480 


ATACAGATTT 


CTCTCCATAT 


CTGATTTCAG 


ATAACTTAGA 


ACAGCCTATG 


GGAAGTAGTC 


3540 


ATGCATCTCA 


GGTTTGTTCT 


GAGACACCTG 


ATGACCTGTT 


AGATGATGGT 


GAAATAAAGG 


3600 


AAGATACTAG 


TTTTGCTGAA 


AATGACATTA 


AGGAAAGTTC 


TGCTGTTTTT 


AGCAAAAGCG 


3660 


TCCAGAGAGG 


AGAGCTTAGC 


AGGAGTCCTA 


GCCCTTTCAC 


C CAT AC AC AT 


TTGGCTCAGG 


3720 


GTTACCGAAG 


AGGGGCCAAG 


AAATTAGAGT 


CCTCAGAAGA 


GAACTTATCT 


AGTGAGGATG 


3780 


AAGAGCTTCC 


CTGCTTCCAA 


CACTTGTTAT 


TTGGTAAAGT 


AAACAATATA 


CCTTCTCAGT 


3840 


CTACTAGGCA 


TAGCACCGTT 


GCTACCGAGT 


GTCTGTCTAA 


GAACACAGAG 


GAGAATTTAT 


3900 


TATCATTGAA 


GAATAGCTTA 


AATGACTGCA 


GTAACCAGGT 


AATATTGGCA 


AAGGCATCTC 


3960 


AGGAACATCA 


CCTTAGTGAG 


GAAACAAAAT 


GTTCTGCTAG 


CTTGTTTTCT 


TCACAGTGCA 


4020 


GTGAATTGGA 


AGACTTGACT 


GCAAATACAA 


ACACCCAGGA 


TCCTTTCTTG 


ATTGGTTCTT 


4080 


CCAAACAAAT 


GAGGCATCAG 


TCTGAAAGCC 


AGGGAGTTGG 


TCTGAGTGAC 


AAGGAATTGG 


4140 


TTTCAGATGA 


TGAAGAAAGA 


GGAACGGGCT 


TGGAAGAAAA 


TAATCAAGAA 


GAGCAAAGCA 


4200 


TGGATTCAAA 


CTTAGGTGAA 


GCAGCATCTG 


GGTGTGAGAG 


TGAAACAAGC 


GTCTCTGAAG 


4260 


ACTGCTCAGG 


GCTATCCTCT 


CAGAGTGACA 


TTTTAACCAC 


TCAGCAGAGG 


GAT AC CATGC 


4320 


AACATAACCT 


GATAAAG CTC 


CAGCAGGAAA 


TGGCTGAACT 


AGAAGCTGTG 


TTAGAACAGC 


4380 


ATGGGAGCCA 


GCCTTCTAAC 


AGCTACCCTT 


CCATCATAAG 


TGACTCCTCT 


GCCCTTGAGG 


4440 


ACCTGCGAAA 


TCCAGAACAA 


AGCACATCAG 


AAAAAGCAGT 


ATTAACTTCA 


CAGAAAAGTA 


4500 


GTGAATACCC 


TATAAGCCAG 


AATCCAGAAG 


GCCTTTCTGC 


TGACAAGTTT 


GAGGTGTCTG 


4560 


CAGATAGTTC 


TACCAGTAAA 


AATAAAGAAC 


CAGGAGTGGA 


AAGGTCATCC 


CCTTCTAAAT 


4620 


GC C CAT CATT 


AGATGATAGG 


TGGTACATGC 


ACAGTTGCTC 


TGGGAGTCTT 


CAGAATAGAA 


4680 


ACTACCCATC 


T C AAGAG& AG 


CTCATTAAGG 


TTGTTGATGT 


GGAGGAGCAA 


CAGCTGGAAG 


4740 


AGTCTGGGCC 


ACACGATTTG 


ACGGAAACAT 


CTTACTTGCC 


AAGGCAAGAT 


CTAGAGGGAA 


4800 


CCCCTTACCT 


GGAATCTGGA 


ATCAGCCTCT 


TCTCTGATGA 


CCCTGAATCT 


GATCCTTCTG 


4860 


AAGACAGAGC 


CCCAGAGTCA 


GCTCGTGTTG 


GCAACATACC 


ATCTTCAACC 


TCTGCATTGA 


4920 
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AAGTTCCCCA ATTGAAAGTT GCAGAATCTG CCCAGGGTCC AGCTGCTGCT CATACTACTG 4 98 0 

ATACTGCTGG GTATAATGCA ATGGAAGAAA GTGTGAGCAG GGAGAAGCCA GAATTGACAG 5040 

CTTCAACAGA AAGGGTCAAC AAAAGAATGT CCATGGTGGT GTCTGGCCTG ACCCCAGAAG 5100 

AATTTATGCT CGTGTACAAG TTTGCCAGAA AACACCACAT CACTTTAACT AATCTAATTA 5160 

CTGAAGAGAC TACTCATGTT GTTATGAAAA CAGATGCTGA GTTTGTGTGT GAACGGACAC 522 0 

TGAAATATTT TCTAGGAATT GCGGGAGGAA AATGGGTAGT TAGCTATTTC TGGGTGACCC 52 80 

AGTCTATTAA AGAAAGAAAA ATGCTGAATG AGCATGATTT TGAAGTCAGA GGAGATGTGG 5340 

TCAATGGAAG AAACCACCAA GGTCCAAAGC GAGCAAGAGA ATCCCAGGAC AGAAAGATCT 54 0 0 

TCAGGGGGCT AGAAATCTGT TGCTATGGGC CCTTCACCAA CATGCCCACA GATCAACTGG 54 60 

AATGGATGGT ACAGCTGTGT GGTGCTTCTG TGGTGAAGGA GCTTTCATCA TTCACCCTTG 5520 

GCACAGGTGT CCACCCAATT GTGGTTGTGC AGCCAGATGC CTGGACAGAG GACAATGGCT 55 80 

TCCATGCAAT TGGGCAGATG TGTGAGGCAC CTGTGGTGAC CCGAGAGTGG GTGTTGGACA 564 0 

GTGTAGCACT CTACCAGTGC CAGGAGCTGG ACACCTACCT GATACCCCAG ATCCCCCACA 57 00 

GCCACTACTG A 5711 
(2) INFORMATION FOR SEQ ID NO : 2 : 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 1863 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : not relevant 

(D) TOPOLOGY: not relevant 

(ii) MOLECULE TYPE: protein 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Homo sapiens 

(B) STRAIN: BRCA1 (omil) ' 

(viii) POSITION IN GENOME: 

(A) CHROMOSOME/SEGMENT: 17 

(B) MAP POSITION: 17q21 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 2 : 

Met Asp Leu Ser Ala Leu Arg Val Glu Glu Val Gin Asn Val lie Asn 

1 5 10 «- 15 

Ala Met Gin Lys lie Leu Glu Cys Pro lie Cys Leu Glu Leu lie Lys 

20 25 30 

Glu Pro Val Ser Thr Lys Cys Asp His lie Phe Cys Lys Phe Cys Met 

35 40 45 

Leu Lys Leu Leu Asn Gin Lys Lys Gly Pro Ser Gin Cys Pro Leu Cys 
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Lys Asn Asp lie Thr 
65 

Gin Leu Val Glu Glu 
85 

Thr Gly Leu Glu Tyr 
100 

Asn Ser Pro Glu His 
115 

Gly Tyr Arg Asn Arg 
130 

Pro Ser Leu Gin Glu 
145 

Thr Val Arg Thr Leu 
165 

Ser Val Tyr lie Glu 
180 

Lys Ala Thr Tyr Cys 
195 

Pro Gin Gly Thr Arg 
210 

Ala Cys Glu Phe Ser 
225 

Pro Ser Asn Asn Asp 
245 

His Pro Glu Lys Tyr 
260 

Pro Cys Gly Thr Asn 
275 

Ser Leu Leu Leu Thr 
290 

Cys Asn Lys Ser Lys 
305 

Trp Ala Gly Ser Lys 
325 

Glu Lys Lys Val Asp 
340 

Trp Asn Lys Gin Lys 
355 

Asp Val Pro Trp lie 
370 



55 

Lys Arg Ser Leu Gin Glu 
70 75 

Leu Leu Lys lie lie Cys 
90 

Ala Asn Ser Tyr Asn Phe 
105 

Leu Lys Asp Glu Val Ser 
120 

Ala Lys Arg Leu Leu Gin 
135 

Thr Ser Leu Ser Val Gin 
150 155 

Arg Thr Lys Gin Arg lie 
170 

Leu Gly Ser Asp Ser Ser 
185 

Ser Val Gly Asp Gin Glu 
200 

Asp Glu lie Ser Leu Asp 
215 

Glu Thr Asp Val Thr Asn 
230 235 

Leu Asn Thr Thr Glu Lys 
250 

Gin Gly Ser Ser Val Ser 
265 

Thr His Ala Sir Ser Leu 
280 

Lys Asp Arg Met Asn Val 
295 

Gin Pro Gly Leu Ala Arg 
310 315 

Glu Thr Cys Asn Asp Arg 
330 

Leu Asn Ala Asp Pro Leu 
345 

Leu Pro Cys Ser Glu Asn 
360 

Thr Leu Asn Ser Ser lie 
375 



60 

Ser Thr Arg Phe Ser 
80 

Ala Phe Gin Leu Asp 
95 

Ala Lys Lys Glu Asn 
110 

lie lie Gin Ser Met 
125 

Ser Glu Pro Glu Asn 
140 

Leu Ser Asn Leu Gly 
160 

Gin Pro Gin Lys Thr 
175 

Glu Asp Thr Val Asn 
190 

Leu Leu Gin lie Thr 
205 

Ser Ala Lys Lys Ala 
220 

Thr Glu His Kis Gin 
240 

Arg Ala Ala Glu Arg 
255 

Asn Leu His Val Glu 
270 

Gin His Glu Asn Ser 
285 

Glu Lys Ala Glu Phe 
300 

Ser Gin His Asn Arg 
320 

Arg Thr Pro Ser Thr 
335 

Cys Glu Arg Lys Glu 
350 

Pro Arg Asp Thr Glu 
365 

Gin Lys Val Asn Glu 
380 
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Trp Phe Ser Arg 
385 

Gly Glu Ser Glu 



Asn Glu Val Asp 
420 

Ala Ser Asp Pro 
435 

Ser Lys Ser Val 
450 

Tyr Arg Lys Lys 
465 

Leu He He Gly 



Pro Leu Thr Asn 
500 

His Pro Glu Asp 
515 

Pro Glu Met He 
530 

Val Met Asn He 
545 

Ser He Gin Asn 



Glu Ser Ala Phe 
580 

Asn Met Glu Leu 
595 

Asn Arg Leu Arg 
610 

Leu Val Val Ser 
625 

He Asp Ser Cys 



Gin Met Pro Val 
660 

Glu Pro Ala Thr 
675 

Ser Lys Arg His 
690 

Ala Pro Gly Ser 



Ser Asp Glu Leu 
390 

Ser Asn Ala Lys 
405 

Glu Tyr Ser Gly 



His Glu Ala Leu 
440 

Glu Ser Asn He 
455 

Ala Ser Leu Pro 
470 

Ala Phe Val Thr 
485 

Lys Leu Lys Arg 



Phe He Lys Lys 
520 

Asn Gin Gly Thr 
535 

Thr Asn Ser Gly 
550 

Glu Lys Asn Pro 
565 

Lys Thr Lys Ala 

Glu Leu Asn He 
600 

Arg Lys Ser Ser 
615 

Arg Asn Leu Ser 
630 

Ser Ser Ser Glu 
645 

Arg His Ser Arg 



Gly Ala Lys Lys 
680 

Asp Ser Asp Thr 
695 

Phe Thr Lys Cys 



Leu Gly Ser Asp 

395 

Val Ala Asp Val 
410 

Ser Ser Glu Lys 
425 

He Cys Lys Ser 



Glu Asp Lys He 
460 

Asn Leu Ser His 
475 

Glu Pro Gin He 
490 

Lys Arg Arg Pro 
505 

Ala Asp Leu Ala 



Asn Gin Thr Glu 
540 

His Glu Asn Lys 
555 

Asn Pro He Glu 
570 

Glu Pro He Ser 
585 

His Asn Ser Lys 



Thr Arg His He 
620 

Pro Pro Asn Cys 
635 

Glu He Lys Lys 
650 

Asn Leu Gin Leu 
665 

Cer Asn Lys Pro 



Phe Pro Glu Leu 
700 

Ser Asn Thr Ser 



Asp Ser His Asp 

400 

Leu Asp Val Leu 
415 

He Asp Leu Leu 
430 

Glu Arg Val His 
445 

Phe Gly Lys Thr 



Val Thr Glu Asn 
480 

He Gin Glu Arg 
495 

Thr Ser Gly Leu 
510 

Val Gin Lys Thr 
525 

Gin Asn Gly Gin 



Thr Lys Gly Asp 
560 

Ser Leu Glu Lys 
575 

Ser Ser He Ser 
590 

Ala Pro Lys Lys 
605 

His Ala Leu Glu 



Thr Glu Leu Gin 
640 

Lys Lys Tyr Asn 
655 

Met Glu Gly Lys 
670 

Asn Glu Gin Thr 
685 

Lys Leu Thr Asn 



Glu Leu Lys Glu 
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Phe Val Asn Pro Ser Leu Pro Arg Glu Glu Lys Glu Glu Lys Leu Glu 
725 730 735 

Thr Val Lys Val Ser Asn Asn Ala Glu Asp Pro Lys Asp Leu Met Leu 
740 745 750 

Ser Gly Glu Arg Val Leu Gin Thr Glu Arg Ser Val Glu Ser Ser Ser 
755 760 765 

lie Ser Leu Val Pro Gly Thr Asp Tyr Gly Thr Gin Glu Ser lie Ser 
770 775 780 

Leu Leu Glu Val Ser Thr Leu Gly Lys Ala Lys Thr Glu Pro Asn Lys 
785 790 795 800 

Cys Val Ser Gin Cys Ala Ala Phe Glu Asn Pro Lys Gly Leu lie His 
805 810 815 

Gly Cys Ser Lys Asp Asn Arg Asn Asp Thr Glu Gly Phe Lys Tyr Pro 
820 825 830 

Leu Gly His Glu Val Asn His Ser Arg Glu Thr Ser lie Glu Met Glu 
835 840 845 

Glu Ser Glu Leu Asp Ala Gin Tyr Leu Gin Asn Thr Phe Lys Val Ser 
850 855^ 860 

Lys Arg Gin Ser Phe Ala Leu Phe Ser Asn Pro Gly Asn Ala Glu Glu 
865 870 -' 875 880 

Glu Cys Ala Thr Phe Ser Ala His Ser Gly Ser Leu Lys Lys Gin Ser 

885 890 895 

Pro Lys Val Thr Phe Glu Cys Glu Gin Lys Glu Glu Asn Gin Gly Lys 
900 905 910 

Asn Glu Ser Asn He Lys Pro Val Gin Thr Val Asn He Thr Ala Gly 
915 920 925 

Phe Pro Val Val Gly Gin Lys Asp Lys Pro Val Asp Asn Ala Lys Cys 
930 935 ' 940 

Ser He Lys Gly Gly Ser Arg Phe Cys Leu Ser Ser Gin Phe Arg Gly 
945 950 955 960 

Asn Glu Thr Gly Leu He Thr Pro Asn Lys His Gly Leu Leu Gin Asn 
965 970 975 

Pro Tyr Arg He Pro Pro Leu Phe Pro He Lys Ser Phe Val Lys Thr 
980 985 990 

Lys Cys Lys Lys Asn Leu Leu Glu Glu Asn Phe Glu Glu His Ser Met 
995 1000 1005 

Ser Pro Glu Arg Glu Met Gly Asn Glu Asn He Pro Ser Thr Val Ser 
1010 1015 1020 

Thr He Ser Arg Asn Asn He Arg Glu Asn Val Phe Lys Gly Ala Ser 
1025 1030 1035 „ 1040 
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Ser Ser Asn lie Asn Glu Val Gly Ser Ser Thr Asn Glu Val Gly Ser 
1045 1050 1055 

Ser lie Asn Glu lie Gly Ser Ser Asp Glu Asn lie Gin Ala Glu Leu 
1060 1065 1070 

Gly Arg Asn Arg Gly Pro Lys Leu Asn Ala Met Leu Arg Leu Gly Val 
1075 1080 1085 

Leu Gin Pro Glu Val Tyr Lys Gin Ser Leu Pro Gly Ser Asn Cys Lys 
1090 1095 1100 

His Pro Glu lie Lys Lys Gin Glu Tyr Glu Glu Val Val Gin Thr Val 
1105 1110 1115 1120 

Asn Thr Asp Phe Ser Pro Tyr Leu lie Ser Asp Asn Leu Glu Gin Pro 
1125 1130 1135 

Met Gly Ser Ser His Ala Ser Gin Val Cys Ser Glu Thr Pro Asp Asp 
1140 1145 1150 

Leu Leu Asp Asp Gly Glu lie Lys Glu Asp Thr Ser Phe Ala Glu Asn 
1155 1160 1165 

Asp He Lys Glu Ser Ser Ala Val Phe Ser Lys Ser Val Gin (Arg Gly 
1170 H75 1180 

Glu Leu Ser Arg Ser Pro Ser Pro Phe Thr His Thr His Leu Ala Gin 
1185 1190 1195 1200 

Gly Tyr Arg Arg Gly Ala Lys Lys Leu Glu Ser Ser Glu Glu Asn Leu 
1205 1210 1215 

Ser Ser Glu Asp Glu Glu Leu Pro Cys Phe Gin His Leu Leu Phe Gly 
1220 1225 1230 

Lys Val Asn Asn He Pro Ser Gin Ser Thr Arg His Ser Thr Val Ala 
1235 1240 1245 

Thr Glu Cys Leu Ser Lys Asn Thr Glu Glu Asn Leu Leu Ser Leu Lys 
1250 1255 1260 

Asn Ser Leu Asn Asp Cys Ser Asn Gin Val He Leu Ala Lys Ala Ser 
1265 1270 1275 1280 

Gin Glu His His Leu Ser Glu Glu Thr Lys Cys Ser Ala Ser Leu Phe 
1285 1290 1295 

Ser Ser Gin Cys Ser Glu Leu Glu Asp Leu Thr Ala Asn Thr Asn Thr 
1300 1305 1310 

Gin Asp Pro Phe Leu He Gly Ser Ser Lys Gin Met Arg His Gin Ser 
1315 1320 1325 

Glu Ser Gin Gly Val Gly Leu Ser Asp Lys Glu Leu Val Ser Asp Asp 
1330 1335 1340 

Glu Glu Arg Gly Thr Gly Leu Glu Glu Asn Asn Gin Glu Glu Gin Ser 
1345 1350 1355 1360 

Met Asp Ser Asn Leu Gly Glu Ala Ala Ser Gly Cys Glu Ser Glu Thr 
1365 1370 1375 



52 



Patent 

Attorney Docket No. 5371.34.US01 



Ser Val Ser Glu Asp Cys Ser Gly Leu Ser Ser Gin Ser Asp lie Leu 
1380 1385 1390 

Thr Thr Gin Gin Arg Asp Thr Met Gin His Asn Leu lie Lys Leu Gin 
1395 1400 1405 

Gin Glu Met Ala Glu Leu Glu Ala Val Leu Glu Gin His Gly Ser Gin 
1410 1415 1420 



Pro Ser Asn Ser Tyr Pro Ser lie 
1425 1430 

Asp Leu Arg Asn Pro Glu Gin Ser 
1445 



lie Ser Asp Ser Ser Ala Leu Glu 
1435 1440 

Thr Ser Glu Lys Ala Val Leu Thr 
1450 1455 



Ser Gin Lys Ser Ser Glu Tyr Pro lie Ser Gin Asn Pro Glu Gly Leu 
1460 1465 1470 

Ser Ala Asp Lys Phe Glu Val Ser Ala Asp Ser Ser Thr Ser Lys Asn 
1475 1480 1485 

Lys Glu Pro Gly Val Glu Arg Ser Ser Pro Ser Lys Cys Pro Ser Leu 

1490 1495 1500 

Asp Asp Arg Trp Tyr Met His Ser Cys Ser Gly Ser Leu Gin Asn Arg 
1505 1510 1515 1520_ 

Asn Tyr Pro Ser Gin Glu Glu Leu lie Lys Val Val Asp Val Glu Glu 
1525 1530 1535 

Gin Gin Leu Glu Glu Ser Gly Pro His Asp Leu Thr Glu Thr Ser Tyr 
1540 1545 1550 

Leu Pro Arg Gin Asp Leu Glu Gly Thr Pro Tyr Leu Glu Ser Gly lie 
1555 1560 1565 

Ser Leu Phe Ser Asp Asp Pro Glu Ser Asp Pro Ser Glu Asp Arg Ala 
1570 1575 1580 

Pro Glu Ser Ala Arg Val Gly Asn lie Pro Ser Ser Thr Ser Ala Leu 
1585 1590 1595 1600 

Lys Val Pro Gin Leu Lys Val Ala Glu Ser Ala Gin Gly' Pro Ala Ala 
1605 1610 ~ 5 1615 

Ala His Thr Thr Asp Thr Ala Gly Tyr Asn Ala Met Glu Glu Ser Val 
1620 1625 1630 

Ser Arg Glu Lys Pro Glu Leu Thr Ala Ser Thr Glu Arg Val Asn Lys 
1635 1640 1645 

Arg Met Ser Met Val Val Ser Gly Leu Thr Pro Glu Glu Phe Met Leu 
1650 1655 1660 

Val Tyr Lys Phe Ala Arg Lys His His lie Thr Leu Thr Asn Leu lie 
1665 1670 1675 1680 

Thr Glu Glu Thr Thr His Val Val Met Lys Thr Asp Ala Glu Phe Val 
1685 1690 1695 

Cys Glu Arg Thr Leu Lys Tyr Phe Leu Gly lie Ala Gly Gly Lys Trp 
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1700 1705 1710 

Val Val Ser Tyr Phe Trp Val Thr Gin Ser lie Lys Glu Arg Lys Met 
1715 1720 1725 

Leu Asn Glu His Asp Phe Glu Val Arg Gly Asp Val Val Asn Gly Arg 
1730 1735 1740 

Asn His Gin Gly Pro Lys Arg Ala Arg Glu Ser Gin Asp Arg Lys lie 
1745 1750 1755 1760 

Phe Arg Gly Leu Glu lie Cys Cys Tyr Gly Pro Phe Thr Asn Met Pro 
1765 1770 1775 

Thr Asp Gin Leu Glu Trp Met Val Gin Leu Cys Gly Ala Ser Val Val 
1780 1785 1790 

Lys Glu Leu Ser Ser Phe Thr Leu Gly Thr Gly Val His Pro lie Val 

1795 1800 1805 

Val Val Gin Pro Asp Ala Trp Thr Glu Asp Asn Gly Phe His Ala lie 
1810 1815 1820 

Gly Gin Met Cys Glu Ala Pro Val Val Thr Arg Glu Trp Val Leu Asp 
1825 1830 1835 1840 

Ser Val Ala Leu Tyr Gin Cys Gin Glu Leu Asp Thr Tyr Leu lie Pro 
1845 1850 1855 



Gin lie Pro His Ser His Tyr 
1860 



(2) INFORMATION FOR SEQ ID NO : 3 : 



(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 5711 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : not relevant 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA ' 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Homo sapiens 

(B) STRAIN : BRCA1 (omi2) 

(viii) POSITION IN GENOME: 

(A) CHROMOSOME /SEGMENT: 17 

(B) MAP POSITION: 17q21 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 3 : 

AGCTCGCTGA GACTTCCTGG ACCCCGCACC AGGCTGTGGG GTTTCTCAGA TAACTGGGCC 6 0 

CCTGCGCTCA GGAGGCCTTC ACCCTCTGCT CTGGGTAAAG TTCATTGGAA CAGAAAGAAA 12 0 

TGGATTTATC TGCTCTTCGC GTTGAAGAAG TACAAAATGT CATTAATGCT ATGCAGAAAA 18 0 
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TCTTAGAGTG 


TCCCATCTGT 


CTGGAGTTGA 


TCAAGGAACC 


TGTCTCCACA 


AAGTGTGACC 


240 


ACATATTTTG 


CAAATTTTGC 


ATGCTGAAAC 


TTCTCAACCA 


GAAGAAAGGG 


CCTTCACAGT 


300 


GTCCTTTATG 


TAAGAATGAT 


ATAACCAAAA 


GGAGCCTACA 


AGAAAGTACG 


AGATTTAGTC 


360 


AACTTGTTGA 


AGAGCTATTG 


AAAATCATTT 


GTGCTTTTCA 


GCTTGACACA 


GGTTTGGAGT 


420 


ATGCAAACAG 


CTATAATTTT 


GCAAAAAAGG 


AAAATAACTC 


TCCTGAACAT 


CTAAAAGATG 


480 


AAGTTTCTAT 


CATCCAAAGT 


ATGGGCTACA 


GAAACCGTGC 


CAAAAGACTT 


CTACAGAGTG 


540 


AACCCGAAAA 


TCCTTCCTTG 


CAGGAAACCA 


GTCTCAGTGT 


CCAACTCTCT 


AACCTTGGAA 


600 


CTGTGAGAAC 


TCTGAGGACA 


AAGCAGCGGA 


TACAAC CTCA 


AAAGACGTCT 


GTCTACATTG 


660 


AATTGGGATC 


TGATTCTTCT 


GAAGATACCG 


TTAATAAGGC 


AACTTATTGC 


AGTGTGG GAG 


720 


ATCAAGAATT 


GTTACAAATC 


ACCCCTCAAG 


GAACCAGGGA 


TGAAATCAGT 


TTGGATTCTG 


780 


CAAAAAAGGC 


TGCTTGTGAA 


TTTTCTGAGA 


CGGATGTAAC 


AAATACTGAA 


CATCATCAAC 


840 


CCAGTAATAA 


TGATTTGAAC 


ACCACTGAGA 


AGCGTGCAGC 


TGAGAGGCAT 


CCAGAAAAGT 


900 


ATCAGGGTAG 


TTCTGTTTCA 


AACTTGCATG 


TGGAGCCATG 


TGGCACAAAT 


ACTCATGCCA 


960 


GCTCATTACA 


GCATGAGAAC 


AGCAGTTTAT 


TACTCACTAA 


AGACAGAATG 


AATGTAGAAA 


1020 


AGGCTGAATT 


CTGTAATAAA 


AGCAAACAGC 


CTGGCTTAGC 


AAGGAGCCAA 


CATAACAGAT 


1080 


GGGCTGGAAG 


TAAGGAAACA 


TGTAATGATA 


GGCGGACTCC 


CAGCACAGAA 


AAAAAGGTAG 


1140 


ATCTGAATGC 


TGATCCCCTG 


TGTGAGAGAA 


AAGAATGGAA 


TAAGCAGAAA 


CTGCCATGCT 


1200 


CAGAGAATCC 


TAGAGATACT 


GAAGATGTTC 


CTTGGATAAC 


ACTAAATAGC 


AGCATTCAGA 


1260 


AAGTTAATGA 


GTGGTTTTCC 


AGAAGTGATG 


AACTGTTAGG 


TTCTGATGAC 


TCACATGATG 


1320 


GGGAGT CTGA 


ATCAAATGCC 


AAAGTAGCTG 


ATGTATTGGA 


CGTTCTAAAT 


GAGGTAGATG 


1380 


AATATT CTGG 


TTCTTCAGAG 


AAAATAGACT 


TACTGfecCAG TGATCCTCAT GAGGCTTTAA 


1440 


TATGTAAAAG 


TGAAAGAGTT 


CACTCCAAAT 


CAGTAGAGAG 


TAATATTGAA 


GACAAAATAT 


1500 


TTGGGAAAAC 


CTATCGGAAG 


AAGGCAAGCC 


TCCCCAACTT 


AAGCCATGTA 


ACTGAAAATC 


1560 


TAATTATAGG 


AGCATTTGTT 


ACTGAGC CAC 


AGATAATACA AGAGCGTCCC 


CTCACAAATA 


1620 


AATTAAAGCG 


TAAAAGGAGA 


CCTACATCAG 


GCCTTCATCC 


TGAGGATTTT 


ATCAAGAAAG 


1680 


CAGATTTGGC 


AGTTCAAAAG 


ACTCCTGAAA 


TGATAAATCA 


GGGAACTAAC 


CAAACGGAGC 


1740 


AGAATGGTCA 


AGTGATGAAT 


ATTACTAATA 


GTGGTCATGA 


GAATAAAACA 


AAAGGTGATT 


1800 


CTATTCAGAA 


TGAGAAAAAT 


CCTAACCCAA 


TAGAATCACT 


CGAAAAAGAA 


TCTGCTTTCA 


1860 


AAAC GAAAGC 


TGAACCTATA 


AGCAGCAGTA 


TAAGCAATAT 


GGAACTCGAA 


TTAAATATCC 


1920 


ACAATTCAAA 


AGCACCTAAA 


AAGAATAGGC 


TGAGGAGGAA 


GTCTTCTACC 


AGGCATATTC 


1980 
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ATGCGCTTGA ACTAGTAGTC AGTAGAAATC TAAGCCCACC TAATTGTACT GAATTG C AAA 2 04 0 

TTGATAGTTG TTCTAGCAGT GAAGAGATAA AGAAAAAAAA GTACAACCAA ATGCCAGTCA 210 0 

GGCACAGCAG AAACCTACAA CTCATGGAAG GTAAAGAACC TGCAACTGGA GCCAAGAAGA 2160 

GTAACAAGCC AAATGAACAG ACAAGTAAAA GACATGACAG TGATACTTTC CCAGAGCTGA 222 0 

AGTTAACAAA TGCACCTGGT TCTTTTACTA AGTGTTCAAA T AC C AGTGAA CTTAAAGAAT 22 80 

TTGTCAATCC TAGCCTTCCA AGAGAAGAAA AAGAAGAGAA ACTAGAAACA GTTAAAGTGT 234 0 

CTAATAATGC TGAAGACCCC AAAGATCTCA TGTTAAGTGG AGAAAGGGTT TTGCAAACTG 24 0 0 

AAAGATCTGT AGAGAGTAGC AGTATTTCAC TGGTACCTGG TACTGATTAT GGCACTCAGG 2460 

AAAGTATCTC GTTACTGGAA GTTAGCACTC TAGGGAAGGC AAAAACAGAA CCAAATAAAT 2 52 0 

GTGTGAGTCA GTGTGCAGCA TTTGAAAACC CCAAGGGACT AATTCATGGT TGTTCCAAAG 2580 

ATAATAGAAA TGACACAGAA GGCTTTAAGT ATCCATTGGG ACATGAAGTT AACCACAGTC 264 0 

GGGAAACAAG CATAGAAATG GAAGAAAGTG AACTTGATGC TCAGTATTTG CAGAATACAT 2 70 0 

TCAAGGTTTC AAAGCGCCAG TCATTTGCTC TGTTTTCAAA TCCAGGAAAT GCAGAAGAGG 2760 

AATGTGCAAC ATTCTCTGCC CACTCTGGGT CCTTAAAGAA ACAAAGTCCA AAAGTCACTT 2 82 0 

TTGAATGTGA ACAAAAGGAA GAAAATCAAG GAAAGAATGA GTCTAATATC AAGCCTGTAC 2 88 0 

AGACAGTTAA TATCACTGCA GGCTTTCCTG TGGTTGGTCA GAAAGATAAG CCAGTTGATA 2340 

ATGC CAAATG TAGTATCAAA GGAGGCTCTA GGTTTTGTCT ATCATCTCAG TTCAGAGGCA 3 00 0 

ACGAAACTGG ACTCATTACT CCAAATAAAC ATGGACTTTT AC AAAAC C C A TATCGTATAC 3 06 0 

CACCACTTTT TCCCATCAAG TCATTTGTTA AAACTAAATG TAAGAAAAAT CTGCTAGAGG 312 0 

AAAACTTTGA GGAACATT C A ATGTCACCTG AAAGAGAAAT GGGAAATGAG AACATTCCAA 318 0 

GTACAGTGAG CACAATTAGC CGTAATAACA TTAGAGAAAA TGTTTTTAAA GGAGCCAGCT 324 0 

1 

CAAGCAATAT TAATGAAGTA GGTTCCAGTA CTAATGAAGT GGGCTCCAGT ATTAATGAAA 33 0 0 

TAGGTTCCAG TGATGAAAAC ATTCAAGCAG AACTAGGTAG AAACAGAGGG CCAAAATTGA 3360 

ATGCTATGCT TAGATTAGGG GTTTTGCAAC CTGAGGTCTA TAAACAAAGT CTTCCTGGAA 342 0 

GTAATTGTAA GCATCCTGAA ATAAAAAAGC AAGAATATGA AGAAGTAGTT CAGACTGTTA 348 0 

ATACAGATTT CTCTCCATAT CTGATTTCAG ATAACTTAGA ACAGCCTATG GGAAGTAGTC 354 0 

ATGCATCTCA GGTTTGTTCT GAGACACCTG ATGACCTGTT AGATGATGGT GAAATAAAGG 3600 

AAGATACTAG TTTTGCTGAA AATGACATTA AGGAAAGTTC TGCTGTTTTT AGCAAAAGCG 366 0 

TCCAGAGAGG AGAGCTTAGC AGGAGTCCTA GCCCTTTCAC C CATACAC AT TTGGCTCAGG 372 0 

GTTAC CGAAG AGGGGCCAAG AAATTAGAGT CCTCAGAAGA GAACTTATCT AGTGAGGATG 378 0 

AAGAGCTTCC CTGCTTCCAA CACTTGTTAT TTGGTAAAGT AAACAATATA CCTTCTCAGT 3 84 0 
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CTACTAGGCA TAGCACCGTT GCTAC CGAGT GTCTGTCTAA GAACACAGAG GAGAATTTAT 3900 

TATCATTGAA GAATAGCTTA AATGACTGCA GTAACCAGGT AATATTGGCA AAGGCATCTC 3 960 

AGGAACATCA CCTTAGTGAG GAAACAAAAT GTTCTGCTAG CTTGTTTTCT TCACAGTGCA 402 0 

GTGAATTGGA AGACTTGACT GCAAATACAA ACACC CAGGA TCCTTTCTTG ATTGGTTCTT 40 8 0 

CCAAACAAAT GAGGCATCAG TCTGAAAGCC AGGGAGTTGG TCTGAGTGAC AAG GAATTGG 414 0 

TTT CAGATGA TGAAGAAAGA GGAACGGGCT TGGAAGAAAA TAATCAAGAA GAGCAAAGCA 42 0 0 

TGGATTCAAA CTTAGGTGAA GCAGCATCTG GGTGTGAGAG TGAAACAAGC GTCTCTGAAG 42 60 

ACTGCTCAGG GCTATCCTCT CAGAGTGACA TTTTAACCAC TCAGCAGAGG GATACCATGC 432 0 

AACATAACCT GATAAAGCTC CAGCAGGAAA TGGCTGAACT AGAAGCTGTG TTAGAACAGC 43 8 0 

ATGGGAGCCA GCCTTCTAAC AGCTACCCTT CCATCATAAG TGACTCTTCT GCCCTTGAGG 444 0 

ACCTGCGAAA TCCAGAACAA AGCACATCAG AAAAAGCAGT ATTAACTTCA CAGAAAAGTA 45 0 0 

GTGAATACCC TATAAGCCAG AATCCAGAAG GCCTTTCTGC TGACAAGTTT GAGGTGTCTG 4560 

CAGATAGTTC TACCAGTAAA AATAAAGAAC CAGGAGTGGA AAGGTCATCC CCTTCTAAAT 4620 

GCCCATCATT AGATGATAGG TGGTACATGC ACAGTTGCTC TGGGAGTCTT CAGAATAGAA 4 68 0 

ACTACCCATC TCAAGAGGAG CTCATTAAGG. TTGTTGATGT GGAGGAGCAA CAGCTGGAAG 474 0 

AGTCTGGGCC ACACGATTTG ACGGAAACAT CTTACTTGCC AAGGCAAGAT CTAGAGGGAA 4 80 0 

CCCCTTACCT GGAATCTGGA ATCAGCCTCT TCTCTGATGA CCCTGAATCT GATCCTTCTG 4 86 0 

AAGACAGAGC CCCAGAGTCA GCTCGTGTTG GCAACAT AC C ATCTTCAACC TCTGCATTGA 4 92 0 

AAGTTCCCCA ATTGAAAGTT GCAGAATCTG CCCAGGGTCC AGCTGCTGCT CATACTACTG 4 98 0 

ATACTGCTGG GTATAATGCA ATGGAAGAAA GTGTGAGCAG GGAGAAGCCA GAATTGACAG 5 04 0 

CTTCAACAGA AAGGGTCAAC AAAAGAATGT CCATGGTGGT GTCTGGCCTG ACCCCAGAAG 5100 

AATTTATGCT CGTGTACAAG TTTGCCAGAA AACACCACAT CACTTTAACT AATCTAATTA 5160 

CTGAAGAGAC TACTCATGTT GTTATGAAAA CAGATGCTGA GTTTGTGTGT GAACGGACAC 522 0 

TGAAATATTT TCTAGGAATT GCGGGAGGAA AATGGGTAGT TAGCTATTTC TGGGTGACCC 52 80 

AGTCTATTAA AGAAAGAAAA ATGCTGAATG AGCATGATTT TGAAGTCAGA GGAGATGTGG 534 0 

TCAATGGAAG AAACCACCAA GGTC CAAAGC GAGCAAGAGA ATCCCAGGAC AGAAAGATCT 540 0 

TCAGGGGGCT AGAAATCTGT TGCTATGGGC CCTTCACCAA CATGCCCACA GATCAACTGG 5460 

AATGGATGGT ACAGCTGTGT GGTGCTTCTG TGGTGAAGGA GCTTTCATCA TTCACCCTTG 552 0 

GCACAGGTGT CCACCCAATT GTGGTTGTGC AGCCAGATGC CTGGACAGAG GACAATGGCT 5 580 

TCCATGCAAT TGGGCAGATG TGTGAGGCAC CTGTGGTGAC CCGAGAGTGG GTGTTGGACA 5 640 



57 



Patent 

Attorney Docket No. 5371.34.US01 

GTGTAGCACT GTACCAGTGC CAGGAGCTGG ACACCTACCT GATACCCCAG ATCCCCCACA 5700 
GCCACTACTG A 5711 
(2) INFORMATION FOR SEQ ID NO : 4 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1863 amino acids 

(B) TYPE: amino acid 

(C) STRAND EDNESS : not relevant 

(D) TOPOLOGY: not relevant 

(ii) MOLECULE TYPE: protein 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Homo sapiens 

(B) STRAIN: BRCA1 (omi2) 

(viii) POSITION IN GENOME: 

(A) CHROMOSOME/ SEGMENT: 17 

(B) MAP POSITION: 17q21 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 4 : 

Met Asp Leu Ser Ala Leu Arg Val Glu Glu Val Gin Asn Val lie Asn 
15 10 15 

Ala Met Gin Lys lie Leu Glu Cys Pro lie Cys Leu Glu Leu lie Lys 
20 25 30 

Glu Pro Val Ser Thr Lys Cys Asp His lie Phe Cys Lys Phe Cys Met 
35 40 45 

Leu Lys Leu Leu Asn Gin Lys Lys Gly Pro Ser Gin Cys Pro Leu Cys 
50 55 60 

Lys Asn Asp lie Thr Lys Arg Ser Leu Gin Glu Ser Thr Arg Phe Ser 
65 70 75 80 

Gin Leu Val Glu Glu Leu Leu Lys lie lie Cys Ala Phe Gin Leu Asp 
85 90 95 

Thr Gly Leu Glu Tyr Ala Asn Ser Tyr Asn Phe Ala Lys Lys Glu Asn 
100 105 110 

Asn Ser Pro Glu His Leu Lys Asp Glu Val Ser lie lie Gin Ser Met 
115 120 125 

Gly Tyr Arg Asn Arg Ala Lys Arg Leu Leu Gin Ser Glu Pro Glu Asn 
130 135 140 

Pro Ser Leu Gin Glu Thr Ser Leu Ser Val Gin Leu Ser Asn Leu Gly 
145 150 155 160 

Thr Val Arg Thr Leu Arg Thr Lys Gin Arg lie Gin Pro Gin Lys Thr 
165 170 175 

Ser Val Tyr lie Glu Leu Gly Ser Asp Ser Ser Glu Asp Thr Val Asn 
180 185 190 
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Lys Ala Thr Tyr 
195 

Pro Gin Gly Thr 
210 

Ala Cys Glu Phe 
225 

Pro Ser Asn Asn 



His Pro Glu Lys 
260 

Pro Cys Gly Thr 
275 

Ser Leu Leu Leu 
290 

Cys Asn Lys Ser 
305 

Trp Ala Gly Ser 



Glu Lys Lys Val 
340 

Trp Asn Lys Gin 
355 

Asp Val Pro Trp 
370 

Trp Phe Ser Arg 
385 

Gly Glu Ser Glu 



Asn Glu Val Asp 
420 

Ala Ser Asp Pro 
435 

Ser Lys Ser Val 
450 

Tyr Arg Lys Lys 
465 

Leu He He Gly 



Pro Leu Thr Asn 
500 



Cys Ser Val Gly 
200 

Arg Asp Glu He 
215 

Ser Glu Thr Asp 
230 

Asp Leu Asn Thr 
245 

Tyr Gin Gly Ser 



Asn Thr His Ala 
280 

Thr Lys Asp Arg 
295 

Lys Gin Pro Gly 
310 

Lys Glu Thr Cys 
325 

Asp Leu Asn Ala 



Lys Leu Pro Cys 
360 

He Thr Leu Asn 
375 

Ser Asp Glu Leu 
390 

Ser Asn Ala Lys 
405 

Glu Tyr Ser Gly 



His Glu Ala Leu 
440 

Glu Ser Asn He 
455 

Ala Ser Leu Pro 
470 

Ala Phe Val Thr 
485 

Lys Leu Lys Arg 



Asp Gin Glu Leu 



Ser Leu Asp Ser 
220 

Val Thr Asn Thr 
235 

Thr Glu Lys Arg 
250 

Ser Val Ser Asn 
265 

Ser Ser Leu Gin 



Met Asn Val Glu 
300 

Leu Ala Arg Ser 
315 

Asn Asp Arg Arg 
330 

Asp Pro Leu Cys 
345 

Ser Glu Asn Pro 



Ser Ser He Gin 
380 

Leu Gly Ser Asp 
395 

Val Ala Asp Val 
* 410 

Ser Ser Glu Lys 
425 

He Cys Lys Ser 



Glu Asp Lys He 
460 

Asn Leu Ser His 
475 

Glu Pro Gin He 
490 

Lys Arg Arg Pro 
505 



Leu Gin He Thr 
205 

Ala Lys Lys Ala 



Glu His His Gin 
240 

Ala Ala Glu Arg 
255 

Leu His Val Glu 
270 

His Glu Asn Ser 
285 

Lys Ala Glu Phe 



Gin His Asn Arg 
320 

Thr Pro Ser Thr 
335 

Glu Arg Lys Glu 
350 

Arg Asp Thr Glu 
365 

Lys Val Asn Glu 



Asp Ser His Asp 
400 

Leu Asp Val Leu 
415 

He Asp Leu Leu 
430 

Glu Arg Val His 
445 

Phe Gly Lys Thr 



Val Thr Glu Asn 
480 

He Gin Glu Arg 
495 

Thr Ser Gly Leu 
510 
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His Pro Glu Asp Phe lie Lys Lys Ala Asp Leu Ala Val Gin Lys Thr 
515 520 525 

Pro Glu Met lie Asn Gin Gly Thr Asn Gin Thr Glu Gin Asn Gly Gin 
530 535 540 

Val Met Asn lie Thr Asn Ser Gly His Glu Asn Lys Thr Lys Gly Asp 
545 550 555 560 

Ser lie Gin Asn Glu Lys Asn Pro Asn Pro lie Glu Ser Leu Glu Lys 
565 570 575 

Glu Ser Ala Phe Lys Thr Lys Ala Glu Pro lie Ser Ser Ser lie Ser 

580 585 590 

Asn Met Glu Leu Glu Leu Asn lie His Asn Ser Lys Ala Pro Lys Lys 
595 600 605 

Asn Arg Leu Arg Arg Lys Ser Ser Thr Arg His lie His Ala Leu Glu 
610 615 620 

Leu Val Val Ser Arg Asn Leu Ser Pro Pro Asn Cys Thr Glu Leu Gin 
625 630 635 640 

lie Asp Ser Cys Ser Ser Ser Glu Glu lie Lys Lys Lys Lys Tyr Asn 
645 650 655 

Gin Met Pro Val Arg His Ser Arg Asn Leu Gin Leu Met Glu Gly Lys 
660 665 670 

Glu Pro Ala Thr Gly Ala Lys Lys Ser Asn Lys Pro Asn Glu Gin Thr 
675 680 685 

Ser Lys Arg His Asp Ser Asp Thr Phe Pro Glu Leu Lys Leu Thr Asn 
690 695 700 

Ala Pro Gly Ser Phe Thr Lys Cys Ser Asn Thr Ser Glu Leu Lys Glu 
705 710 715 720 

Phe Val Asn Pro Ser Leu Pro Arg Glu Glu Lys Glu Glu Lys Leu Glu 
725 730 735 

Thr Val Lys Val Ser Asn Asn Ala G?u Asp Pro Lys Asp Leu Met Leu 
740 745 750 

Ser Gly Glu Arg Val Leu Gin Thr Glu Arg Ser Val Glu Ser Ser Ser 
755 760 765 

lie Ser Leu Val Pro Gly Thr Asp Tyr Gly Thr Gin Glu Ser lie Ser 
770 775 780 

Leu Leu Glu Val Ser Thr Leu Gly Lys Ala Lys Thr Glu Pro Asn Lys 
785 790 795 800 

Cys Val Ser Gin Cys Ala Ala Phe Glu Asn Pro Lys Gly Leu lie His 
805 810 815 

Gly Cys Ser Lys Asp Asn Arg Asn Asp Thr Glu Gly Phe Lys Tyr Pro 
820 825 830 

Leu Gly His Glu Val Asn His Ser Arg Glu Thr Ser lie Glu Met Glu 
835 840 845 
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Glu Ser Glu Leu Asp Ala Gin Tyr Leu Gin. Asn Thr Phe Lys Val Ser 
850 855 860 

Lys Arg Gin Ser Phe Ala Leu Phe Ser Asn Pro Gly Asn Ala Glu Glu 
865 870 875 880 

Glu Cys Ala Thr Phe Ser Ala His Ser Gly Ser Leu Lys Lys Gin Ser 

885 890 895 

Pro Lys Val Thr Phe Glu Cys Glu Gin Lys Glu Glu Asn Gin Gly Lys 
900 905 910 

Asn Glu Ser Asn lie Lys Pro Val Gin Thr Val Asn lie Thr Ala Gly 
915 920 925 

Phe Pro Val Val Gly Gin Lys Asp Lys Pro Val Asp Asn Ala Lys Cys 
930 935 940 

Ser lie Lys Gly Gly Ser Arg Phe Cys Leu Ser Ser Gin Phe Arg Gly 
945 950 955 960 

Asn Glu Thr Gly Leu lie Thr Pro Asn Lys His Gly Leu Leu Gin Asn 
965 970 975 

Pro Tyr Arg lie Pro Pro Leu Phe Pro lie Lys Ser Phe Val Lys Thr 
980 985 990 

Lys Cys Lys Lys Asn Leu Leu Glu Glu Asn Phe Glu Glu His Ser Met 
995 1000 1005 

Ser Pro Glu Arg Glu Met Gly Asn Glu Asn lie Pro Ser Thr Val Ser 
1010 1015 1020 

Thr lie Ser Arg Asn Asn lie Arg Glu Asn Val Phe Lys Gly Ala Ser 
1025 1030 1035 1040 

Ser Ser Asn lie Asn Glu Val Gly Ser Ser Thr Asn Glu Val Gly Ser 
1045 1050 1055 

Ser lie Asn Glu lie Gly Ser Ser Asp Glu Asn lie Gin Ala Glu Leu 
1060 1^65 1070 

Gly Arg Asn Arg Gly Pro Lys Leu Asn Ala Met Leu Arg Leu Gly Val 
1075 1080 1085 

Leu Gin Pro Glu Val Tyr Lys Gin Ser Leu Pro Gly Ser Asn Cys Lys 
1090 1095 1100 

His Pro Glu lie Lys Lys Gin Glu Tyr Glu Glu Val Val Gin Thr Val 
1105 1110 1115 1120 

Asn Thr Asp Phe Ser Pro Tyr Leu lie Ser Asp Asn Leu Glu Gin Pro 
1125 1130 1135 

Met Gly Ser Ser His Ala Ser Gin Val Cys Ser Glu Thr Pro Asp Asp 
1140 1145 1150 

Leu Leu Asp Asp Gly Glu lie Lys Glu Asp Thr Ser Phe Ala Glu Asn 
1155 1160 1165 

Asp lie Lys Glu Ser Ser Ala Val Phe Ser Lys Ser Val Gin Arg Gly 
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1170 1175 1180 

Glu Leu Ser Arg Ser Pro Ser Pro Phe Thr His Thr His Leu Ala Gin 
1185 1190 1195 1200 

Gly Tyr Arg Arg Gly Ala Lys Lys Leu Glu Ser Ser Glu Glu Asn Leu 
1205 1210 1215 

Ser Ser Glu Asp Glu Glu Leu Pro Cys Phe Gin His Leu Leu Phe Gly 
1220 1225 1230 

Lys Val Asn Asn lie Pro Ser Gin Ser Thr Arg His Ser Thr Val Ala 
1235 1240 1245 

Thr Glu Cys Leu Ser Lys Asn Thr Glu Glu Asn Leu Leu Ser Leu Lys 
1250 1255 1260 

Asn Ser Leu Asn Asp Cys Ser Asn Gin Val lie Leu Ala Lys Ala Ser 
1265 1270 1275 1280 

Gin Glu His His Leu Ser Glu Glu Thr Lys Cys Ser Ala Ser Leu Phe 
1285 1290 1295 



Ser Ser Gin Cys Ser Glu Leu Glu Asp Leu Thr Ala Asn Thr Asn Thr 
1300 1305 1310 

Gin Asp Pro Phe Leu lie Gly Ser Ser Lys Gin Met Arg His Gin Ser 
1315 1320 1325 

Glu Ser Gin Gly Val Gly Leu> Ser Asp Lys Glu Leu Val Ser Asp Asp 
1330 1335 1340 



Glu Glu Arg Gly Thr Gly Leu Glu Glu Asn Asn Gin Glu Glu Gin Ser 
1345 1350 1355 1360 

Met Asp Ser Asn Leu Gly Glu Ala Ala Ser Gly Cys Glu Ser Glu Thr 
1365 1370 1375 



Ser Val Ser Glu Asp Cys Ser Gly Leu Ser Ser Gin Ser Asp lie Leu 
1380 1385 1390 

Thr Thr Gin Gin Arg Asp Thr Met dn His Asn Leu lie Lys Leu Gin 
1395 1400 1405 



Gin Glu Met Ala Glu Leu Glu Ala Val Leu Glu Gin His Gly Ser Gin 
1410 1415 1420 

Pro Ser Asn Ser Tyr Pro Ser lie lie Ser Asp Ser Ser Ala Leu Glu 
1425 1430 1435 1440 

Asp Leu Arg Asn Pro Glu Gin Ser Thr Ser Glu Lys Ala Val Leu Thr 
1445 1450 1455 

Ser Gin Lys Ser Ser Glu Tyr Pro lie Ser Gin Asn Pro Glu Gly Leu 
1460 1465 1470 

Ser Ala Asp Lys Phe Glu Val Ser Ala Asp Ser Ser Thr Ser Lys Asn 
1475 1480 1485 

Lys Glu Pro Gly Val Glu Arg Ser Ser Pro Ser Lys Cys Pro Ser Leu 
1490 1495 1500 
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Asp Asp Arg Trp Tyr Met His Ser Cys Ser Gly Ser Leu Gin Asn Arg 
1505 1510 1515 1520 

Asn Tyr Pro Ser Gin Glu Glu Leu lie Lys Val Val Asp Val Glu Glu 
1525 1530 1535 

Gin Gin Leu Glu Glu Ser Gly Pro His Asp Leu Thr Glu Thr Ser Tyr 
1540 1545 1550 

Leu Pro Arg Gin Asp Leu Glu Gly Thr Pro Tyr Leu Glu Ser Gly lie 
1555 1560 1565 

Ser Leu Phe Ser Asp Asp Pro Glu Ser Asp Pro Ser Glu Asp Arg Ala 
1570 1575 1580 

Pro Glu Ser Ala Arg Val Gly Asn lie Pro Ser Ser Thr Ser Ala Leu 
1585 1590 1595 1600 

Lys Val Pro Gin Leu Lys Val Ala Glu Ser Ala Gin Gly Pro Ala Ala 
1605 1610 1615 

Ala His Thr Thr Asp Thr Ala Gly Tyr Asn Ala Met Glu Glu Ser Val 
1620 1625 1630 

Ser Arg Glu Lys Pro Glu Leu Thr Ala Ser Thr Glu Arg Val Asn. Lys 
1635 1640 1645 



Arg Met Ser Met Val Val Ser Gly Leu Thr Pro Glu Glu Phe Met Leu 
1650 1655 1660 

Val Tyr Lys Phe Ala Arg Lys His His lie Thr Leu Thr Asn Leu lie 
1665 1670 1675 1680 

Thr Glu Glu Thr Thr His Val Val Met Lys Thr Asp Ala Glu Phe Val 
1685 1690 1695 

Cys Glu Arg Thr Leu Lys Tyr Phe Leu Gly lie Ala Gly Gly Lys Trp 
1700 1705 1710 

Val Val Ser Tyr Phe Trp Val Thr gLi Ser lie Lys Glu Arg Lys Met 
1715 1720 1725 

Leu Asn Glu His Asp Phe Glu Val Arg Gly Asp Val Val Asn Gly Arg 
1730 1735 1740 

Asn His Gin Gly Pro Lys Arg Ala Arg Glu Ser Gin Asp Arg Lys lie 
1745 1750 1755 1760 

Phe Arg Gly Leu Glu lie Cys Cys Tyr Gly Pro Phe Thr Asn Met Pro 
1765 1770 1775 

Thr Asp Gin Leu Glu Trp Met Val Gin Leu Cys Gly Ala Ser Val Val 
1780 1785 1790 

Lys Glu Leu Ser Ser Phe Thr Leu Gly Thr Gly Val His Pro He Val 

1795 1800 1805 

Val Val Gin Pro Asp Ala Trp Thr Glu Asp Asn Gly Phe His Ala He 
1810 1815 1820 
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Gly Gin Met Cys Glu Ala Pro Val Val Thr Arg Glu Trp Val Leu Asp 
1825 1830 1835 1840 

Ser Val Ala Leu Tyr Gin Cys Gin Glu Leu. Asp Thr Tyr Leu lie Pro 
1845 1850 1855 

Gin lie Pro His Ser His Tyr 
1860 

(2) INFORMATION FOR SEQ ID NO : 5 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 5711 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : not relevant 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Homo sapiens 

(B) STRAIN: BRCA1 (omi3) 

(viii) POSITION IN GENOME: 

(A) CHROMOSOME/ SEGMENT: 17 
<B) MAP POSITION: 17q21 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 5 : 

AGCTCGCTGA GACTTCCTGG ACCCCGCACC AGGCTGTGGG GTTTCTCAGA TAACTGGGCC 60 

CCTGCGCTCA GGAGGCCTTC ACCCTCTGCT CTGGGTAAAG TTCATTGGAA CAGAAAGAAA 12 0 

TGGATTTATC TGCTCTTCGC GTTGAAGAAG TACAAAATGT CATTAATGCT ATGCAGAAAA 18 0 

TCTTAGAGTG TCCCATCTGT CTGGAGTTGA TCAAGGAACC TGTCTCCACA AAGTGTGACC 24 0 

ACATATTTTG CAAATTTTGC ATGCTGAAAC TTCTCAACCA GAAGAAAGGG CCTTCACAGT 300 

GTCCTTTATG TAAGAATGAT ATAACCAAAA GGAGCCTACA AGAAAGTACG AGATTTAGTC 360 

AACTTGTTGA AGAGCTATTG AAAATCATTT GTGCTTTTCA GCTTGACACA GGTTTGGAGT 42 0 

ATGCAAACAG CTATAATTTT GCAAAAAAGG AAAATAACTC TCCTGAACAT CTAAAAGATG 48 0 

AAGTTTCTAT CATCCAAAGT ATGGGCTACA GAAACCGTGC CAAAAGACTT CTACAGAGTG 54 0 

AACCCGAAAA TCCTTCCTTG CAGGAAACCA GTCTCAGTGT CCAACTCTCT AACCTTGGAA 60 0 

CTGTGAGAAC TCTGAGGACA AAGCAGCGGA TACAACCTCA AAAGACGTCT GTCTACATTG 660 

AATTGGGATC TGATTCTTCT GAAGATACCG TTAATAAGGC AACTTATTGC AGTGTGGGAG 72 0 

AT C AAGAATT GTTACAAATC ACCCCTCAAG GAACCAGGGA TGAAATCAGT TTGGATTCTG 780 

CAAAAAAGGC TGCTTGTGAA TTTTCTGAGA CGGATGTAAC AAATACTGAA CATCATCAAC 840 

CCAGTAATAA TGATTTGAAC ACCACTGAGA AGCGTGCAGC TGAGAGGCAT CCAGAAAAGT 900 
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AT C AGGGT AG TTCTGTTTCA AACTTGCATG TGGAGCCATG TGGCACAAAT ACTCATGCCA 960 

GCTCATTACA GCATGAGAAC AGCAGTTTAT TACTCACTAA AGACAGAATG AATGTAGAAA 102 0 

AGGCTGAATT CTGTAATAAA AGCAAACAGC CTGGCTTAGC AAGGAGCCAA CATAACAGAT 1080 

GGGCTGGAAG TAAGGAAACA TGTAATGATA GGCGGACTCC CAGCACAGAA AAAAAGGTAG 1140 

AT CTGAATGC TGATCCCCTG TGTGAGAGAA AAGAATGGAA TAAG C AG AAA CTGCCATGCT 1200 

CAGAGAATCC TAGAGATACT GAAGATGTTC CTTGGATAAC ACTAAATAGC AGCATTCAGA 1260 

AAGTTAATGA GTGGTTTTCC AGAAGTGATG AACTGTTAGG TTCTGATGAC TCACATGATG 13 20 

GGGAGTCTGA ATCAAATGCC AAAGTAGCTG ATGTAT TGGA CGTTCTAAAT GAGGTAGATG 1380 

AATATTCTGG TTCTTCAGAG AAAAT AG AC T TACTGGCCAG TGATCCTCAT GAGGCTTTAA 1440 

TATGTAAAAG TGAAAGAGTT CACTCCAAAT CAGTAGAGAG TAATATTGAA GACAAAATAT 1500 

TTGGGAAAAC CTATCGGAAG AAGGCAAGCC TCCCCAACTT AAGCCATGTA ACTGAAAATC 1560 

TAATTATAGG AGCATTTGTT ACTGAGCCAC AGATAATACA AGAGCGTCCC CTCACAAATA 1620 

AATTAAAGCG TAAAAGGAGA CCTACATCAG GCCTTCATCC TGAGGATTTT ATCAAGAAAG 1680 

CAGATTTGGC AGTTCAAAAG ACTCCTGAAA T GATAAATC A GGGAACTAAC CAAACGGAGC 174 0 

AGAATGGTCA AGTGATGAAT ATTACTAATA GTGGTCATGA GAATAAAACA AAAGGTGATT 1800 

CTATTCAGAA TGAGAAAAAT CCTAACCCAA TAGAATCACT CGAAAAAGAA TCTGCTTTCA 1860 

AAACGAAAGC TGAACCTATA AGCAGCAGTA TAAGCAATAT GGAACTCGAA TTAAATATCC 192 0 

ACAATTCAAA AGC ACC T AAA AAGAATAGGC TGAGGAGGAA GTCTTCTACC AGGCATATTC 198 0 

ATGCGCTTGA ACTAGTAGTC AGTAGAAATC TAAGCCCACC TAATTGTACT GAATTGCAAA 2 04 0 

TTGATAGTTG TTCTAGCAGT GAAGAGATAA AGAAAAAAAA GTACAACCAA ATGCCAGTCA 210 0 

GGCACAGCAG AAACCTACAA CTCATGGAAG GTAAAGAACC TGCAACTGGA GCCAAGAAGA 216 0 

GTAACAAGCC AAATGAACAG ACAAGTAAAA GACATGACAG CGATACTTTC CCAGAGCTGA 222 0 

AGTTAACAAA TGCACCTGGT TCTTTTACTA AGTGTT CAAA TACCAGTGAA CTTAAAGAAT 22 8 0 

TTGTCAATCC TAGCCTTCCA AGAGAAGAAA AAGAAGAGAA ACTAGAAACA GTTAAAGTGT 2 34 0 

CTAATAATGC TGAAGACCCC AAAGATCTCA TGTTAAGTGG AGAAAGGGTT TTGCAAACTG 2400 

AAAGATCTGT AGAGAGTAGC AGTATTTCAT TGGTAC CTGG TACTGATTAT GGCACTCAGG 246 0 

AAAGTATCTC GTTACTGGAA GTTAGCACTC TAGGGAAGGC AAAAACAGAA CCAAATAAAT 2 52 0 

GTGTGAGTCA GTGTGCAGCA TTTGAAAACC CCAAGGGACT AATTCATGGT TGTTCCAAAG 2580 

ATAATAGAAA TGACACAGAA GGCTTTAAGT ATCCATTGGG ACATGAAGTT AACCACAGTC 2 64 0 

GGGAAACAAG CATAGAAATG GAAGAAAGTG AACTTGATGC TCAGTATTTG CAGAATACAT 2 70 0 
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TCAAGGTTTC AAAGCGCCAG TCATTTGCTC TGTTTTCAAA TCCAGGAAAT GCAGAAGAGG 2 7 60 

AATGTGCAAC ATTCTCTGCC CACTCTGGGT CCTTAAAGAA ACAAAGTCCA AAAGTCACTT 2 820 

TTGAATGTGA ACAAAAGGAA GAAAATCAAG GAAAGAATGA GTCTAATATC AAGCCTGTAC 2 8 80 

AGACAGTTAA TATCACTGCA GGCTTTCCTG TGGTTGGTCA GAAAGATAAG CCAGTTGATA 2 94 0 

ATGC CAAATG TAGTATCAAA GGAGGCTCTA GGTTTTGTCT ATCATCTCAG TTCAGAGGCA 3 000 

ACGAAACTGG ACTCATTACT CCAAATAAAC ATGGACTTTT ACAAAACCCA TATCGTATAC 3 060 

CACCACTTTT TCCCATCAAG TCATTTGTTA AAACTAAATG TAAGAAAAAT CTGCTAGAGG 312 0 

AAAACTTTGA GGAACATTCA ATGTCACCTG AAAGAGAAAT GGGAAATGAG AACATTCCAA 3180 

GTACAGTGAG CACAATTAGC CGTAATAACA TTAGAGAAAA TGTTTTTAAA GAAGCCAGCT 3 240 

CAAGCAATAT TAATGAAGTA GGTTCCAGTA CTAATGAAGT GGGCTCCAGT ATTAATGAAA 33 00 

TAGGTTCCAG TGATGAAAAC ATTCAAGCAG AACTAGGTAG AAACAGAGGG CCAAAATTGA 3360 

ATGCTATGCT TAGATTAGGG GTTTTGCAAC CTGAGGTCTA TAAACAAAGT CTTCCTGGAA 3420 

GTAATTGTAA GCATCCTGAA ATAAAAAAGC AAGAATATGA AGAAGTAGTT CAGACTGTTA 3 4 80 

ATACAGATTT CTCTCCATAT CTGATTTCAG ATAACTTAGA ACAGCCTATG GGAAGTAGTC 3 540 

ATGCATCTCA GGTTTGTTCT GAGACACCTG ATGACCTGTT AGATGATGGT GAAATAAAGG 3 600 

AAGATACTAG TTTTGCTGAA AATGACATTA AGGAAAGTT C TGCTGTTTTT AGCAAAAGCG 3 660 

TCCAGAAAGG AGAGCTTAGC AGGAGTCCTA GCCCTTTCAC CCATACACAT TTGGCTCAGG 3 72 0 

GTTACCGAAG AGGGGCCAAG AAATTAGAGT CCTCAGAAGA GAACTTATCT AGTGAGGATG 3780 

AAGAGCTTCC CTGCTTCCAA CACTTGTTAT TTGGTAAAGT AAACAATATA CCTTCTCAGT 3 840 

CTACTAGGCA TAGCACCGTT GCTACCGAGT GTCTGTCTAA GAACACAGAG GAGAATTTAT 3 900 

TATCATTGAA GAATAGCTTA AATGACTGCA GTAACC.AGGT AATATTGGCA AAGGCATCTC 3 960 

AGGAACATCA CCTTAGTGAG GAAACAAAAT GTTCTGCTAG CTTGTTTTCT TCACAGTGCA 4 02 0 

GTGAATTGGA AGACTTGACT GCAAATACAA ACACCCAGGA TCCTTTCTTG ATTGGTTCTT 4 0 80 

CCAAACAAAT GAGGCATCAG TCTGAAAGCC AGGGAGTTGG TCTGAGTGAC AAGGAATTGG 414 0 

TTTCAGATGA TGAAGAAAGA GGAACGGGCT TGGAAGAAAA TAATCAAGAA GAGCAAAGCA 420 0 

TGGATTCAAA CTTAGGTGAA GCAGCATCTG GGTGTGAGAG TGAAACAAGC GTCTCTGAAG 42 60 

ACTGCTCAGG GCTATCCTCT CAGAGTGACA TTTTAACCAC TCAGCAGAGG GATACCATGC 432 0 

AACATAACCT GATAAAGCTC CAGCAGGAAA TGGCTGAACT AGAAGCTGTG TTAGAACAGC 43 8 0 

ATGGGAGCCA GCCTTCTAAC AGCTACCCTT C CAT C AT AAG TGACTCTTCT GCCCTTGAGG 444 0 

ACCTGCGAAA TCCAGAACAA AGCACATCAG AAAAAGCAGT ATTAACTTCA CAGAAAAGTA 45 00 

GTGAATACCC TATAAGCCAG AATCCAGAAG GCCTTTCTGC TGACAAGTTT GAGGTGTCTG 4560 
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CAGATAGTTC TAC C AGT AAA AATAAAGAAC CAGGAGTGGA AAGGTCATCC CCTTCTAAAT 4 62 0 

GCCCATCATT AGATGATAGG TGGTACATGC ACAGTTGCTC TGGGAGTCTT CAGAATAGAA 4 68 0 

ACTACCCATC TCAAGAGGAG CTCATTAAGG TTGTTGATGT GGAGGAGCAA CAGCTGGAAG 4 74 0 

AGTCTGGGCC ACACGATTTG AC GGAAAC AT CTTACTTGCC AAGGCAAGAT CTAGAGGGAA 4 80 0 

CCCCTTACCT GGAATCTGGA ATCAGCCTCT TCTCTGATGA CCCTGAATCT GATCCTTCTG 4 8 60 

AAGACAGAGC CCCAGAGTCA GCTCGTGTTG GCAACATACC ATCTTCAACC TCTGCATTGA 4 92 0 

AAGTTCCCCA ATTGAAAGTT GCAGAATCTG CCCAGAGTCC AGCTGCTGCT CATACTACTG 4 98 0 

ATACTGCTGG GTATAATGCA ATGGAAGAAA GTGTGAGCAG GGAGAAGCCA GAATTGACAG 5 04 0 

CTTCAACAGA AAGGGTCAAC AAAAGAATGT CCATGGTGGT GTCTGGCCTG ACCCCAGAAG 5100 

AATTTATGCT CGTGTACAAG TTTGC CAGAA AACACCACAT CACTTTAACT AATCTAATTA 5160 

CTGAAGAGAC TACTCATGTT GTTATGAAAA CAGATGCTGA GTTTGTGTGT GAACGGACAC 522 0 

TGAAATATTT TCTAGGAATT GCGGGAGGAA AATGGGTAGT TAGCTATTTC TGGGTGACCC 5280 

AGTCTATTAA AGAAAGAAAA ATGCTGAATG AGCATGATTT TGAAGTCAGA GGAGATGTGG 534 0 

TCAATGGAAG AAACCACCAA GGTCCAAAGC GAGCAAGAGA ATC CCAGGAC AGAAAGATCT 540 0 

TCAGGGGGCT AGAAATCTGT TGCTATGGGC CCTTCACCAA CATGCCCACA GATCAACTGG 5460 

AATGGATGGT ACAGCTGTGT GGTGCTTCTG TGGTGAAGGA GCTTTCATCA TTCACCCTTG 5 52 0 

GCACAGGTGT CCACCCAATT GTGGTTGTGC AGCCAGATGC CTGGACAGAG GACAATGGCT 5580 

TCCATGCAAT TGGGCAGATG TGTGAGGCAC CTGTGGTGAC CCGAGAGTGG GTGTTGGACA 5640 

GTGTAGCACT CTACCAGTGC CAGGAGCTGG ACACCTACCT GATACCCCAG ATCCCCCACA 570 0 

GCCACTACTG A 5 711 
(2) INFORMATION FOR SEQ ID NO : 6 : * 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1863 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : not relevant 

(D) TOPOLOGY: not relevant 

(ii) MOLECULE TYPE: protein 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Homo sapiens 

(B) STRAIN : BRCA1 (omi3) 

(viii) POSITION IN GENOME: 

(A) CHROMOSOME/ SEGMENT: 17 

(B) MAP POSITION: 17q21 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 6 : 



Met Asp Leu Ser Ala Leu Arg Val Glu Glu Val Gin Asn Val lie Asn 
15 10 15 

Ala Met Gin Lys lie Leu Glu Cys Pro lie Cys Leu Glu Leu lie Lys 
20 25 30 

Glu Pro Val Ser Thr Lys Cys Asp His lie Phe Cys Lys Phe Cys Met 
35 40 45 

Leu Lys Leu Leu Asn Gin Lys Lys Gly Pro Ser Gin Cys Pro Leu Cys 
50 55 60 

Lys Asn Asp lie Thr Lys Arg Ser Leu Gin Glu Ser Thr Arg Phe Ser 
65 70 75 80 

Gin Leu Val Glu Glu Leu Leu Lys lie lie Cys Ala Phe Gin Leu Asp 
85 90 95 

Thr Gly Leu Glu Tyr Ala Asn Ser Tyr Asn Phe Ala Lys Lys Glu Asn 
100 105 110 

Asn Ser Pro Glu His Leu Lys Asp Glu Val Ser lie lie Gin Ser Met 
115 120 125 

Gly Tyr Arg Asn Arg Ala Lys Arg Leu Leu Gin Ser Glu Pro Glu Asn 
130 135 140 

Pro Ser Leu Gin Glu Thr Ser Leu Ser Val Gin Leu Ser Asn Leu Gly 
145 150 155 160 

Thr Val Arg Thr Leu Arg Thr Lys Gin Arg lie Gin Pro Gin Lys Thr 
165 170 175 

Ser Val Tyr lie Glu Leu Gly Ser Asp Ser Ser Glu Asp Thr Val Asn 
180 185 190 

Lys Ala Thr Tyr Cys Ser Val Gly Asp Gin Glu Leu Leu Gin lie Thr 
195 200 205 

Pro Gin Gly Thr Arg Asp Glu lie sir Leu Asp Ser Ala Lys Lys Ala 
210 215 220 

Ala Cys Glu Phe Ser Glu Thr Asp Val Thr Asn Thr Glu His His Gin 
225 230 235 240 

Pro Ser Asn Asn Asp Leu Asn Thr Thr Glu Lys Arg Ala Ala Glu Arg 
245 250 255 

His Pro Glu Lys Tyr Gin Gly Ser Ser Val Ser Asn Leu His Val Glu 
260 265 270 

Pro Cys Gly Thr Asn Thr His Ala Ser Ser Leu Gin His Glu Asn Ser 
27b 280 285 

Ser Leu Leu Leu Thr Lys Asp Arg Met Asn Val Glu Lys Ala Glu Phe 
290 295 300 

Cys Asn Lys Ser Lys Gin Pro Gly Leu Ala Arg Ser Gin His Asn Arg 
305 310 315 320 
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Trp Ala Gly Ser 



Glu Lys Lys Val 
340 

Trp Asn Lys Gin 
355 

Asp Val Pro Trp 
370 

Trp Phe Ser Arg 
385 

Gly Glu Ser Glu 



Asn Glu Val Asp 
420 

Ala Ser Asp Pro 
435 

Ser Lys Ser Val 
450 

Tyr Arg Lys Lys 
465 

Leu lie lie Gly 



Pro Leu Thr Asn 
500 

His Pro Glu Asp 
515 

Pro Glu Met He 
530 

Val Met Asn He 
545 

Ser He Gin Asn 



Glu Ser Ala Phe 
580 

Asn Met Glu Leu 
595 

Asn Arg Leu Arg 
610 

Leu Val Val Ser 
625 

He Asp Ser Cys 



Lys Glu Thr Cys 
325 

Asp Leu Asn Ala 



Lys Leu Pro Cys 
360 

He Thr Leu Asn 
375 

Ser Asp Glu Leu 
390 

Ser Asn Ala Lys 
405 

Glu Tyr Ser Gly 



His Glu Ala Leu 
440 

Glu Ser Asn He 
455 

Ala Ser Leu Pro 
470 

Ala Phe Val Thr 
485 

Lys Leu Lys Arg 



Phe He Lys Lys 
520 

Asn Gin Gly Thr 
535 

Thr Asn Ser Gly 
550 

Glu Lys Asn Pro 
565 

Lys Thr Lys Ala 

Glu Leu Asn He 
600 

Arg Lys Ser Ser 
615 

Arg Asn Leu Ser 
630 

Ser Ser Ser Glu 



Asn Asp Arg Arg 
330 

Asp Pro Leu Cys 
345 

Ser Glu Asn Pro 



Ser Ser He Gin 
380 

Leu Gly Ser Asp 
395 

Val Ala Asp Val 
410 

Ser Ser Glu Lys 
425 

He Cys Lys Ser 



Glu Asp Lys He 
460 

Asn Leu Ser His 
475 

Glu Pro Gin He 
490 

Lys Arg Arg Pro 
505 

Ala Asp Leu Ala 



Asn Gin Thr Glu 
540 

His Glu Asn Lys 
555 

Asn Pro He Glu 
570 

Glu Pro He Ser 
585 

His Asn Ser Lys 



Thr Arg His He 
620 

Pro Pro Asn Cys 
635 

Glu He Lys Lys 



Thr Pro Ser Thr 
335 

Glu Arg Lys Glu 
350 

Arg Asp Thr Glu 
365 

Lys Val Asn Glu 



Asp Ser His Asp 
400 

Leu Asp Val Leu 
415 

He Asp Leu Leu 
430 

Glu Arg Val His 
445 

Phe Gly Lys Thr - 



Val Thr Glu Asn 
480 

He Gin Glu Arg 
495 

Thr Ser Gly Leu 
510 

Val Gin Lys Thr 
525 

Gin Asn Gly Gin 



Thr Lys Gly Asp 
560 

Ser Leu Glu Lys 
575 

Ser Ser He Ser 
590 

Ala Pro Lys Lys 
605 

His Ala Leu Glu 



Thr Glu Leu Gin 
640 

Lys Lys Tyr Asn 



69 



Patent 

Attorney Docket No. 5371.34.US01 



Gin Met Pro Val Arg His Ser Arg Asn Leu Gin Leu Met Glu Gly Lys 
660 665 670 

Glu Pro Ala Thr Gly Ala Lys Lys Ser Asn Lys Pro Asn Glu Gin Thr 
675 680 685 

Ser Lys Arg His Asp Ser Asp Thr Phe Pro Glu Leu Lys Leu Thr Asn 
690 695 700 

Ala Pro Gly Ser Phe Thr Lys Cys Ser Asn Thr Ser Glu Leu Lys Glu 
705 710 715 720 

Phe Val Asn Pro Ser Leu Pro Arg Glu Glu Lys Glu Glu Lys Leu Glu 
725 730 735 

Thr Val Lys Val Ser Asn Asn Ala Glu Asp Pro Lys Asp Leu Met Leu 
740 745 750 

Ser Gly Glu Arg Val Leu Gin Thr Glu Arg Ser Val Glu Ser Ser Ser 
755 760 765 

lie Ser Leu Val Pro Gly Thr Asp Tyr Gly Thr Gin Glu Ser lie Ser 
770 775 780 

Leu Leu Glu Val Ser Thr Leu Gly Lys Ala Lys Thr Glu Pro Asn Lys 
785 790 795 800 

Cys Val Ser Gin Cys Ala Ala Phe Glu Asn Pro Lys Gly Leu lie His 
805 810 815 

Gly Cys Ser Lys Asp Asn Arg Asn Asp Thr Glu Gly Phe Lys Tyr Pro 
820 825 830 

Leu Gly His Glu Val Asn His Ser Arg Glu Thr Ser lie Glu Met Glu 
835 840 845 

Glu Ser Glu Leu Asp Ala Gin Tyr Leu Gin Asn Thr Phe Lys Val Ser 
850 855 860 

Lys Arg Gin Ser Phe Ala Leu Phe Ser Asn Pro Gly Asn Ala Glu Glu 
865 870 875 880 

Glu Cys Ala Thr Phe Ser Ala His Ser Gly Ser Leu Lys Lys Gin Ser 

885 890 895 

Pro Lys Val Thr Phe Glu Cys Glu Gin Lys Glu Glu Asn Gin Gly Lys 
900 905 910 

Asn Glu Ser Asn lie Lys Pro Val Gin Thr Val Asn lie Thr Ala Gly 
915 920 925 

Phe Pro Val Val Gly Gin Lys Asp Lys Pro Val Asp Asn Ala Lys Cys 
930 935 940 

Ser lie Lys Gly Gly Ser Arg Phe Cys Leu Ser Ser Gin Phe Arg Gly 
945 950 955 960 

Asn Glu Thr Gly Leu lie Thr Pro Asn Lys His Gly Leu Leu Gin Asn 
965 970 975 



70 



Patent 

Attorney Docket No. 5371.34.US01 



Pro Tyr Arg lie Pro Pro Leu Phe Pro lie Lys Ser Phe Val Lys Thr 
980 985 990 

Lys Cys Lys Lys Asn Leu Leu Glu Glu Asn Phe Glu Glu His Ser Met 

995 1000 1005 

Ser Pro Glu Arg Glu Met Gly Asn Glu Asn lie Pro Ser Thr Val Ser 
1010 1015 1020 

Thr lie Ser Arg Asn Asn lie Arg Glu Asn Val Phe Lys Glu Ala Ser 
1025 1030 1035 1040 

Ser Ser Asn lie Asn Glu Val Gly Ser Ser Thr Asn Glu Val Gly Ser 
1045 1050 1055 

Ser lie Asn Glu lie Gly Ser Ser Asp Glu Asn lie Gin Ala Glu Leu 
1060 1065 1070 

Gly Arg Asn Arg Gly Pro Lys Leu Asn Ala Met Leu Arg Leu Gly Val 
1075 1080 1085 

Leu Gin Pro Glu Val Tyr Lys Gin Ser Leu Pro Gly Ser Asn Cys Lys 
1090 1095 1100 

His Pro Glu lie Lys Lys Gin Glu Tyr Glu Glu Val Val Gin Thr Val 
1105 1110 1115 1120- 

Asn Thr Asp Phe Ser Pro Tyr Leu lie Ser Asp Asn Leu Glu Gin Pro 
1125 1130 1135 

Met Gly Ser Ser His Ala Ser Gin Val Cys Ser Glu Thr Pro Asp Asp 
1140 1145 1150 

Leu Leu Asp Asp Gly Glu lie Lys Glu Asp Thr Ser Phe Ala Glu Asn 
1155 1160 1165 



Asp lie Lys Glu Ser Ser Ala Val 
1170 1175 

Glu Leu Ser Arg Ser Pro Ser Pro 
1185 1190 

Gly Tyr Arg Arg Gly Ala Lys Lys 
1205 

Ser Ser Glu Asp Glu Glu Leu Pro 
1220 



Phe Ser Lys Ser Val Gin Lys Gly 
1180 

Phe Thr His Thr His Leu Ala Gin 
t 1195 1200 

Leu Glu Ser Ser Glu Glu Asn Leu 
1210 1215 

Cys Phe Gin His Leu Leu Phe Gly 
1225 1230 



Lys Val Asn Asn He Pro Ser Gin Ser Thr Arg His Ser Thr Val Ala 
1235 1240 1245 

Thr Glu Cys Leu Ser Lys Asn Thr Glu Glu Asn Leu Leu Ser Leu Lys 
1250 1255 1260 

Asn Ser Leu Asn Asp Cys Ser Asn Gin Val lie Leu Ala Lys Ala Ser 
1265 1270 1275 1280 

Gin Glu His His Leu Ser Glu Glu Thr Lys Cys Ser Ala Ser Leu Phe 
1285 1290 1295 

Ser Ser Gin Cys Ser Glu Leu Glu Asp Leu Thr Ala Asn Thr Asn Thr 
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Gin Asp Pro Phe Leu He Gly Ser Ser Lys Gin Met Arg His Gin Ser 
1315 1320 1325 

Glu Ser Gin Gly Val Gly Leu Ser Asp Lys Glu Leu Val Ser Asp Asp 
1330 1335 1340 

Glu Glu Arg Gly Thr Gly Leu Glu Glu Asn Asn Gin Glu Glu Gin Ser 
1345 1350 1355 1360 

Met Asp Ser Asn Leu Gly Glu Ala Ala Ser Gly Cys Glu Ser Glu Thr 
1365 1370 1375 

Ser Val Ser Glu Asp Cys Ser Gly Leu Ser Ser Gin Ser Asp He Leu 
1380 1385 1390 

Thr Thr Gin Gin Arg Asp Thr Met Gin His Asn Leu He Lys Leu Gin 
1395 1400 1405 

Gin Glu Met Ala Glu Leu Glu Ala Val Leu Glu Gin His Gly Ser Gin 
1410 1415 1420 

Pro Ser Asn Ser Tyr Pro Ser He He Ser Asp Ser Ser Ala Leu Glu 
1425 1430 1435 1440 

Asp Leu Arg Asn Pro Glu Gin Ser Thr Ser Glu Lys Ala Val Leu Thr 
1445 1450 1455 

Ser Gin Lys Ser Ser Glu Tyr* Pro He Ser Gin Asn Pro Glu Gly Leu 
1460 1465 1470 

Ser Ala Asp Lys Phe Glu Val Ser Ala Asp Ser Ser Thr Ser Lys Asn 
1475 1480 1485 

Lys Glu Pro Gly Val Glu Arg Ser Ser Pro Ser Lys Cys Pro Ser Leu 
1490 1495 1500 

Asp Asp Arg Trp Tyr Met His Ser Cys Ser Gly Ser Leu Gin Asn Arg 
1505 1510 1515 1520 

Asn Tyr Pro Ser Gin Glu Glu Leu lie Lys Val Val Asp Val Glu Glu 
1525 1530 1535 

Gin Gin Leu Glu Glu Ser Gly Pro His Asp Leu Thr Glu Thr Ser Tyr 
1540 1545 1550 

Leu Pro Arg Gin Asp Leu Glu Gly Thr Pro Tyr Leu Glu Ser Gly He 
1555 1560 1565 

Ser Leu Phe Ser Asp Asp Pro Glu Ser Asp Pro Ser Glu Asp Arg Ala 
1570 1575 1580 

Pro Glu Ser Ala Arg Val Gly Asn He Pro Ser Ser Thr Ser Ala Leu 
1585 1590 1595 1600 

Lys Val Pro Gin Leu Lys Val Ala Glu Ser Ala Gin Ser Pro Ala Ala 
1605 1610 1615 

Ala His Thr Thr Asp Thr Ala Gly Tyr Asn Ala Met Glu Glu Ser Val 
1620 1625 1630 
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Ser Arg Glu Lys Pro Glu Leu Thr Ala Ser Thr Glu Arg Val Asn Lys 
1635 1640 1645 

Arg Met Ser Met Val Val Ser Gly Leu Thr Pro Glu Glu Phe Met Leu 
1650 1655 1660 

Val Tyr Lys Phe Ala Arg Lys His His lie Thr Leu Thr Asn Leu lie 
1665 1670 1675 1680 

Thr Glu Glu Thr Thr His Val Val Met Lys Thr Asp Ala Glu Phe Val 
1685 1690 1695 

Cys Glu Arg Thr Leu Lys Tyr Phe Leu Gly lie Ala Gly Gly Lys Trp 
1700 1705 1710 

Val Val Ser Tyr Phe Trp Val Thr Gin Ser He Lys Glu Arg Lys Met 

1715 1720 1725 

Leu Asn Glu His Asp Phe Glu Val Arg Gly Asp Val Val Asn Gly Arg 
1730 1735 1740 

Asn His Gin Gly Pro Lys Arg Ala Arg Glu Ser Gin Asp Arg Lys He 
1745 1750 1755 1760 

Phe Arg Gly Leu Glu He Cys Cys Tyr Gly Pro Phe Thr Asn Met Pro - 
1765 1770 1775 

Thr Asp Gin Leu Glu Trp Met Val Gin Leu Cys Gly Ala Ser Val Val 
1780 1785 1790 

Lys Glu Leu Ser Ser Phe Thr Leu Gly Thr Gly Val His Pro He Val 
1795 1800 1805 

Val Val Gin Pro Asp Ala Trp Thr Glu Asp Asn Gly Phe His Ala He 
1810 1815 1820 

Gly Gin Met Cys Glu Ala Pro Val Val Thr Arg Glu Trp Val Leu Asp 
1825 1830 1835 1840 

Ser Val Ala Leu Tyr Gin Cys Gin GJ.u Leu Asp Thr Tyr Leu He Pro 
1845 1850 1855 

Gin He Pro His Ser His Tyr 
1860 

(2) INFORMATION FOR SEQ ID NO : 7 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 base pairs 

(B) TYPE: nucleic acid 
(C) STRAND EDNESS : not relevant 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(vi) ORIGINAL SOURCE: 

(B) STRAIN: 2F primer 
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Cxi) SEQUENCE DESCRIPTION: SEQ ID NO : 7 : 
GAAGTTGTCA TTTTATAAAC CTTT 24 
(2) INFORMATION FOR SEQ ID NO : 8 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : not relevant 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(vi) ORIGINAL SOURCE: 

(B) STRAIN: 2R primer 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 8 : 
TGTCTTTTCT TCCCTAGTAT GT 22 
(2) INFORMATION FOR SEQ ID NO : 9 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: not relevant 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(vi) ORIGINAL SOURCE: 

(B) STRAIN: 3F primer 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 9: 



t 

T C CTGACAC A GCAGACATTT A 21 
(2) INFORMATION FOR SEQ ID NO: 10: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: not relevant 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(vi) ORIGINAL SOURCE: 

(B) STRAIN: 3R primer 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10: 
TTGGATTTTC GTTCTCACTT A 21 
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(2) INFORMATION FOR SEQ ID NO: 11: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 0 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : not relevant 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(vi) ORIGINAL SOURCE: 

(B) STRAIN: 5F primer 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 11: 
CTCTTAAGGG CAGTTGTGAG 



(2) INFORMATION FOR SEQ ID NO : 12 : 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 0 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: not relevant 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: DNA (genomic) 



(vi) ORIGINAL SOURCE: 

(B) STRAIN: 5R-M13* primer 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 12 : 
TTCCTACTGT GGTTGCTTCC 2 0 



(2) INFORMATION FOR SEQ ID NO : 13 : ^ 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: not relevant 

(D) TOPOLOGY: linear 

(■ii) MOLECULE TYPE: DNA (genomic) 

(vi) ORIGINAL SOURCE: 

(B) STRAIN: 6/7F primer 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 13 : 
CTTATTTTAG TGTCCTTAAA AGG 
(2) INFORMATION FOR SEQ ID NO: 14: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : not relevant 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(vi) ORIGINAL SOURCE: 
(B) STRAIN: 6R 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 14 : 
TTTCATGGAC AGCACTTGAG TG 22 
(2) INFORMATION FOR SEQ ID NO: 15: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: not relevant 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(vi) ORIGINAL SOURCE: 

(B) STRAIN: 7F primer 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 15: 
CACAACAAAG AG CAT AC AT A GGG 23 
(2) INFORMATION FOR SEQ ID NO : 16 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: not relevant 

(D) TOPOLOGY: linear ' 

(ii) MOLECULE TYPE: DNA (genomic) 

(vi) ORIGINAL SOURCE: 

(B) STRAIN: 6/7R primer 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 16: 



TCGGGTTCAC TCTGTAGAAG 
(2) INFORMATION FOR SEQ ID NO: 17: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: not relevant 
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(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: DNA (genomic) 



(vi) ORIGINAL SOURCE: 

(B) STRAIN : 8F1 primer 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 17: 



TTCTCTTCAG GAGGAAAAGC A 
(2) INFORMATION FOR SEQ ID NO: 18: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 base pairs 

(B) TYPE: nucleic acid 

<C) STRAND EDNESS : not relevant 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(vi) ORIGINAL SOURCE: 

(B) STRAIN: 8R1 primer 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:18: 
GCTGCCTACC ACAAATACAA A 



(2) INFORMATION FOR SEQ ID NO: 19: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 3 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : not relevant 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: DNA (genomic) 



(vi) ORIGINAL SOURCE: 

(B) STRAIN: 9F primer 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 19: 
CCACAGTAGA TGCT C AGTAA ATA 



(2) INFORMATION FOR SEQ ID NO: 20: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: not relevant 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
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(vi) ORIGINAL SOURCE: 

(B) STRAIN: 9R primer 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 20: 
TAGGAAAATA CCAGCTTCAT AGA 2 3 



(2) INFORMATION FOR SEQ ID NO: 21: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 0 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: not relevant 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(vi) ORIGINAL SOURCE : 

(B) STRAIN: 10F primer 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 21: 
TGGTCAGCTT TCTGTAATCG 20 
(2) INFORMATION FOR SEQ ID NO: 22: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: not relevant 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE : DNA (genomic) 

(vi) ORIGINAL SOURCE: 

(B) STRAIN: 10R primer 9 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 22: 
GTATCTACCC ACTCTCTTCT TCAG 24 
(2) INFORMATION FOR SEQ ID NO: 23: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 19 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: not relevant 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(vi) ORIGINAL SOURCE: 

(B) STRAIN: 11AF primer 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 23: 

CCACCTCCAA GGTGTATCA 19 

(2) INFORMATION FOR SEQ ID NO: 24: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : not relevant 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(vi) ORIGINAL SOURCE: 

(B) STRAIN: 11AR primer 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:24: 
TGTTATGTTG GCTCCTTGCT 2 0 

(2) INFORMATION FOR SEQ ID NO: 25: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: not relevant 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

• (vi) ORIGINAL SOURCE: 

(B) STRAIN: 11BF1 primer 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:25: 
CACTAAAGAC AGAATGAATC TA 22 
(2) INFORMATION FOR SEQ ID NO: 26: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: not relevant 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(vi) ORIGINAL SOURCE: 

(B) STRAIN: 11BR1 primer 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 26: 
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GAAGAACCAG AATATTCATC TA 22 

(2) INFORMATION FOR SEQ ID NO: 27: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 0 base pairs 

(B) TYPE: nucleic acid 

(C) STRAND EDNESS : not relevant 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

<vi) ORIGINAL SOURCE: 

(B) STRAIN: 11CF1 primer 



{xi) SEQUENCE DESCRIPTION: SEQ ID NO: 27: 
TGATGGGGAG TCTGAATCAA 2 0 

(2) INFORMATION FOR SEQ ID NO: 28: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : not relevant 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(vi) ORIGINAL SOURCE: 

(B) STRAIN: 11CR1 primer 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 28: 
TCTGCTTTCT TGATAAAATC CT 2 2 

(2) INFORMATION FOR SEQ ID NO: 29: 

(i) SEQUENCE CHARACTERISTICS: * 

(A) LENGTH: 2 0 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: not relevant 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(vi) ORIGINAL SOURCE: 

(B) STRAIN: 11DF1 primer 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 29: 

AGCGTCCCCT CACAAATAAA 2 0 

(2) INFORMATION FOR SEQ ID NO: 30: 
(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 2 0 base pairs 
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(B) TYPE: nucleic acid 

(C) STRAND EDNESS : not relevant 

(D) TOPOLOGY : linear 

(ii) MOLECULE TYPE : DNA (genomic) 

(vi) ORIGINAL SOURCE : 

(B) STRAIN: 11DR1 primer 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:30: 
TCAAGCGCAT GAATATGCCT 2 0 

(2) INFORMATION FOR SEQ ID NO: 31: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 2 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : not relevant 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(vi) ORIGINAL SOURCE: 

(B) STRAIN: 11EF primer 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 31: 
GTATAAGCAA TATGGAACTC GA 22 
(2) INFORMATION FOR SEQ ID NO: 32: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 3 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: not relevant 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(vi) ORIGINAL SOURCE: 

(B) STRAIN: HER primer 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:32: 

TTAAGTTCACT GGTATTTGAA CA 23 

(2) INFORMATION FOR SEQ ID NO: 33: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 0 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: not relevant 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



81 



Patent 

Attorney Docket No. 5371.34.US01 



(vi) ORIGINAL SOURCE: 

(B) STRAIN: 11FF primer 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 33: 
GACAGCGATA CTTTCCCAGA 2 0 

(2) INFORMATION FOR SEQ ID NO: 34: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 base pairs 

(B) TYPE: nucleic acid 

(C) STRAND EDNESS : not relevant 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(vi) ORIGINAL SOURCE: 

(B) STRAIN: 11FR primer 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 34 : 
TGGAACAACC ATGAATTAGT C 21 
(2) INFORMATION FOR SEQ ID NO: 35: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : not relevant 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) ^ 

(vi) ORIGINAL SOURCE: 

(B) STRAIN: 11GF primer 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 35: 

GGAAGTTAGC ACTCTAGGGA 20 

(2) INFORMATION FOR SEQ ID NO: 36: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: not relevant 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(vi) ORIGINAL SOURCE: 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 36: 
GCAGTGATAT TAACTGTCTG TA '22 
(2) INFORMATION FOR SEQ ID NO: 37: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : not relevant 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(vi) ORIGINAL SOURCE: 

(B) STRAIN: 11HF primer 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 37: 
TGGGTCCTTA AAGAAACAAA GT 22. 
(2) INFORMATION FOR SEQ ID NO: 38: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 21 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: not relevant 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE : DNA (genomic) 

(vi) ORIGINAL SOURCE: 

(B) STRAIN: 11HR primer 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 38: 

TCAGGTGACA TTGAATCTTC C 21 

(2) INFORMATION FOR SEQ ID NO: 39: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : not relevant 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(vi) ORIGINAL SOURCE: 

(B) STRAIN: 11IF primer 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 39: 
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CCACTTTTTC CCATCAAGTC A 21 
(2) INFORMATION FOR SEQ ID NO: 40: 

<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : not relevant 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(vi) ORIGINAL SOURCE: 

(B) STRAIN: 11IR primer 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:40: 
TCAGGATGCT TACAATTACT TC 22 
(2) INFORMATION FOR SEQ ID NO: 41: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 3 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: not relevant 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(vi) ORIGINAL SOURCE: 

(B) STRAIN: 11JF primer 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 41: 
CAAAATTGAA TGCTATGCTT AGA 

(2) INFORMATION FOR SEQ ID NO: 4 2: ' 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 0 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: not relevant 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(vi) ORIGINAL SOURCE: 

(B) STRAIN: 11JR primer 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:42: 
TCGGTAACCC TGAGCCAAAT 
(2) INFORMATION FOR SEQ ID NO: 43: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 0 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: not relevant 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(vi) ORIGINAL SOURCE: 

(B) STRAIN: 11KF primer 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:43: 
GCAAAAGCGT CCAGAAAGGA 20 
(2) INFORMATION FOR SEQ ID NO: 44: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: not relevant 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(vi) ORIGINAL SOURCE: 

(B) STRAIN: 11KR-1 primer 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:44: 

TATTTGCAGT CAAGTCTTCC AA 22 

(2) INFORMATION FOR SEQ ID NO:45: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: not relevant 

(D) TOPOLOGY: linear ' 

(ii) MOLECULE TYPE: DNA (genomic) 

(vi) ORIGINAL SOURCE: 

(B) STRAIN: 11LF-1 primer 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:45: 
GTAATATTGG CAAAGGCATC T 21 
(2) INFORMATION FOR SEQ ID NO: 46: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: not relevant 

(D) TOPOLOGY: linear 
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(ii) MOLECULE TYPE : DNA (genomic) 

(vi) ORIGINAL SOURCE: 

(B) STRAIN: 11LR primer 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 46: 
TAAAATGTGC TCCCCAAAAG CA 22 
(2) INFORMATION FOR SEQ ID NO: 47: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : not relevant 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(vi) ORIGINAL SOURCE: 

(B) STRAIN: 12F primer 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 47: 

GTCCTGCCAA TGAGAAGAAA 2 0 

(2) INFORMATION FOR SEQ ID NO: 48: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: not relevant 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(vi) ORIGINAL SOURCE: 

(B) STRAIN: 12R primer * 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:48: 
TGTCAGCAAA CCTAAGAATG T 21 
(2) INFORMATION FOR SEQ ID NO: 49: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: not relevant 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE : DNA (genomic) 

(vi) ORIGINAL SOURCE: 

(B) STRAIN: 13F primer 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 49: 
AATGGAAAGC TTCTCAAAGT A 21 
(2) INFORMATION FOR SEQ ID NO: 50: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : not relevant 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(vi) ORIGINAL SOURCE: 

(B) STRAIN: 13R primer 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 50: 

ATGTTGGAGC TAGGTCCTTA C 21 

(2) INFORMATION FOR SEQ ID NO: 51: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : not relevant 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(vi) ORIGINAL SOURCE: 

(B) STRAIN: 14 F primer 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 51: 

t 

CTAACCTGAA TTATCACTAT CA 22 
(2) INFORMATION FOR SEQ ID NO: 52: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: not relevant 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(vi) ORIGINAL. SOURCE : 
(B) STRAIN: 14R primer 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 52 : 
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GTGTATAAAT GCCTGTATGC A 
(2) INFORMATION FOR SEQ ID NO: 53: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 19 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : not relevant 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(vi) ORIGINAL SOURCE: 

(B) STRAIN: 15F primer 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 53: 

TGGCTGCCCA GGAAGTATG 

(2) INFORMATION FOR SEQ ID NO: 54: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : not relevant 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(vi) ORIGINAL SOURCE: 

(B) STRAIN: 15R primer 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 54 : 
AACCAGAATA TCTTTATGTA GGA 

(2) INFORMATION FOR SEQ ID NO: 55: ^ 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: not relevant 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE:' DNA (genomic) 

(vi) ORIGINAL SOURCE: 

(B) STRAIN: 16F primer 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 55: 
AATTCTTAAC AGAGACCAGA AC 
(2) INFORMATION FOR SEQ ID NO: 56: 
(i) SEQUENCE CHARACTERISTICS: 
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(A) LENGTH: 2 2 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : not relevant 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(vi) ORIGINAL SOURCE: 

(B) STRAIN: 16R primer 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:56: 

AAAACTCTTT CCAGAATGTT GT 22 

(2) INFORMATION FOR SEQ ID NO: 57: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 0 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: not relevant 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(vi) ORIGINAL SOURCE: 

(B) STRAIN: 17F primer 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 57: 
GTGTAGAACG TGCAGGATTG 2 0 

(2) INFORMATION FOR SEQ ID NO: 58: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 18 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: not relevant 

(D) TOPOLOGY: linear ? 

(ii) MOLECULE TYPE: DNA (genomic) 

(vi) ORIGINAL SOURCE: 

(B) STRAIN: 17R primer 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:58: 



TCGCCTCATG TGGTTTTA 
(2) INFORMATION FOR SEQ ID NO: 59: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: not relevant 

(D) TOPOLOGY: linear 
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(ii) MOLECULE TYPE: DNA (genomic) 



(vi) ORIGINAL SOURCE: 

(B) STRAIN: 18F primer 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 59: 

GGCTCTTTAG CTTCTTAGGA C 21 

(2) INFORMATION FOR SEQ ID NO: 60: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : not relevant 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(vi) ORIGINAL SOURCE: 

(B) STRAIN: 18R primer 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: SO: 
GAGACCATTT TCCCAGCATC 2 0 

(2) INFORMATION FOR SEQ ID NO: 61: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: not relevant 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(vi) ORIGINAL SOURCE : 

(B) STRAIN: 19F primer 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 61: 
CTGTCATTCT TCCTGTGCTC 2 0 

(2) INFORMATION FOR SEQ ID NO: 62: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: not relevant 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(vi) ORIGINAL SOURCE: 

(B) STRAIN: 19R primer 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 62: 

CATTGTTAAG GAAAGTGGTG C 21 

(2) INFORMATION FOR SEQ ID NO:63: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 0 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : not relevant 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(vi) ORIGINAL SOURCE: 

(B) STRAIN: 2 OF primer 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 63: 
ATATGACGTG TCTGCTCCAC 2 0 

(2) INFORMATION FOR SEQ ID NO : 64 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 0 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: not relevant 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(vi) ORIGINAL SOURCE: 

(B) STRAIN: 2 OR primer 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 64: 
GGGAATCCAA ATTACACAGC 2 0 

(2) INFORMATION FOR SEQ ID NO: 65: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: not relevant 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(vi) ORIGINAL SOURCE: 

(B) STRAIN: 2 IF primer 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 65: 
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AAGCTCTTCC TTTTTGAAAG TC 22 

(2) INFORMATION FOR SEQ ID NO: 66: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : not relevant 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(vi) ORIGINAL SOURCE: 

(B) STRAIN: 21R primer 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 66: 
GTAGAGAAAT AGAATAGCCT CT 2 2 

(2) INFORMATION FOR SEQ ID NO: 67: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: not relevant 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(vi) ORIGINAL SOURCE: 

(B) STRAIN: 22F primer 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 67: 
TCCCATTGAG AGGTCTTGCT 2 0 

(2) INFORMATION FOR SEQ ID NO: 68: 

(i) SEQUENCE CHARACTERISTICS: ' 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: not relevant 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE : DNA (genomic) 

(vi) ORIGINAL SOURCE: 

(B) STRAIN: 22R primer 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 68 

GAGAAGACTT CTGAGGCTAC 

(2) INFORMATION FOR SEQ ID NO: 69: 
(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 21 base pairs 
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(B) TYPE : nucleic acid 

(C) STRANDEDNESS :. not relevant 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(vi) ORIGINAL SOURCE: 

(B) STRAIN: 23F-1 primer 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 69: 
TGAAGTGACA GTTCCAGTAG T 21 
(2) INFORMATION FOR SEQ ID NO: 70: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 3 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: not relevant 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(vi) ORIGINAL SOURCE: 

(B) STRAIN: 23R-1 primer 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:70: 
CATTTTAGCC ATTCATTCAA CAA 2 3 

(2) INFORMATION FOR SEQ ID NO: 71: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : not relevant 

(D) TOPOLOGY: linear ^ 

(ii) MOLECULE TYPE: DNA (genomic) 

(vi) ORIGINAL SOURCE: 

(B) STRAIN: 24F primer 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 71: 

ATGAATTGAC ACTAATCTCT GC 

(2) INFORMATION FOR SEQ ID NO: 72: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: not relevant 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



93 



Patent 

Attorney Docket No. 5371.34.US01 



(vi) ORIGINAL SOURCE: 

(B) STRAIN: 24R primer 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 72: 
GTAGCCAGGA CAGTAGAAGG A 
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